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PREFACE TO THIRD EDITION 

In preparing this third, ‘Var’’ edition of the Measurement of Adult 
Intelligence my primary aim has been to increase the book's usefulness 
to those engaged in the field of applied psychometrics. I have borne 
in mind particularly the military applications of the Wechsler-Bellevue 
Scale, as it is now being used by some of the armed services of our own 
country as well as of Great Britain. Accordingly, most of the changes 
made will be found in the chaptei’s dealing with Clinical applications 
and the Problem of menial deterioration. These chapters have been 
somewhat extended and in part rewritten. The table of ^signs' for 
different clinical entities has been completely revised and a new quan- 
titative method for determining mental deterioration elaborated. Both 
of these presentations should be of value not only in the field of neuro- 
psychiatric diagnosis but also in questions connected with rehabilitation. 

In addition to the above, some changes have been made in the scoring 
of certain subtests of the Scale. The effect of these changes will be to 
increase the I.Q. range at the upper end of the Scale, without however 
altering significantly the norms for the population as a whole. In the 
case of one or two tests (noticeably, the Arithmetic) the change has 
necessitated slight alteration in the scoring of some of the subtest items. 
Little alteration has otherwise been made in the substance of the text. 
One table and two figures which were not particularly useful have been 
omitted and two new tables which should be of immediate and practical 
value added. Also added is a bibliography of most of the important 
studies with the Wechsler-Bellevue Scale, published up to March 1944. 
For editorial perusal of the text I am indebted to Miss Ventura Smith, 
psychologist, to the Westport, Conn., public schools, who not only read 
the page proofs but was also kind enough to prepare a new index. 

David Wechsler. 

April, 1944 
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Paht I 

THE NATURE AND CLASSIFICATION 
OP INTELLIGENCE 




CHAPTER 1 

THE NATURE OF INTELLIGENCE 


Some years ago when interest in intelligence tests was at its height, 
a prominent psychologist is reported to have answered an inquiry as 
to what he meant by intelligence by saying that it is what intelligence 
tests measure. A similar attitude would not be maintained today by 
any considerable number of psychologists. But the continued failure 
of authors of intelligence tests to declare explicitly what they under- 
stand by general intelligence would almost compel one to assume that 
they still maintain this circular position. The lay person is entirely 
justified in asking, as he does, ^TIow do you know that your test meas- 
ures intelligence?^ ^ and every author of a test should be ready to answer 
the question, however imperfectly. Obviously, the more data the psy- 
chologist has, the easier his task will be. But he will be able to make 
no answer at all unless there is some provisional agreement between him 
and his challenger as to what they are willing to call intelligence, or 
at least intelligent behavior. We shall, therefore, begin by giving our 
own definition of intelligence and then consider its relation to the more 
important current theories on the subject. 

Intelligence is the aggregate or global capacity of the individual to act 
purposefully^ omd to deal effectively with hi^^ environ^ 

ment It is globaP because it characterizes the individual's behavior as 
a whole; it is an aggregate because it is composed of elements or abilities 
which, though not entirely independent, are qualitatively differentiable. 
By measurement of these abilities, we ultimately evaluate intelligence. 
But intelligence is not identical with the mere sum of these abilities, 
however inclusive. ‘There are three important reasons for this: (1) 
The ultimate products of intelligent behavior are not only a function of 
the number of abilities or their quality but also of thojya^Jn which they 
are combined, that is, upon their configuration. (2X Factors other 
than intellectual ability, for example, those of drive and incentive, enter 
into intelligent behavior. (3) Finally, while different orders of intelli- 
gent behavior may require varying degrees of intellectual ability, an 
excess of any given ability may add relatively little to the effectiveness 

^ In the adopted French sense of the term, ^ ^pertaining to or embracing the 
totality of a group of items or categories.*" Cf. Oxford Dictionary. 
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of the behavior as a whole. It would seem that, so far as general in- 
telligence is concerned, intellectual ability as such merely enters as a 
necessary minimum. Thus,''to act intelligently, one must be able to 
recall numerous items — i.e., have a retentive memory. But beyond a 
certain point this ability will not help much in coping with life situations 
successfully. This is true of even more important capacities, such as 
the ability to reason, particularly when specialized. The unusual 
reasoning abilities of the mathematician are more highly correlated with 
the thing that we ultimately measure as intelligence than sheer memory 
is, but possession of this ability is no surety that behavior as a whole 
will be very intelligent in the sense above defined. Every reader will 
be able to recall persons of high intellectual ability in some particular 
field, whom they would unhesitatingly characterize as below average 
in general intelligence. 

Although intelligence is no mere sum of intellectual abilities, the only 
way we can evaluate it quantitatively is by the measurement of the 
various aspects of these abilities. There is no contradiction here unless 
we insist upon the identity of general intelligence and intellectual abil- 
ity. We do not, for example, identify electricity with our modes of 
measuring it. Our measurements of electricity consist of quantitative 
records of its chemical, thermal and magnetic effects. But these effects 
are not identical with the “stuff^' which produced them. General in- 
telligence, like electricity, may be regarded as a kind of energy. We do 
not know what the ultimate nature of this energy is, but as in the case 
of electricity, we know it by the things it does or, better, by the things 
it enables us to do — ^such as making appropriate associations between 
events, drawing correct inferences from propositions, understanding the 
meaning of words, solving mathematical problems or building bridges. 
These are the effects of intelligence in the same sense as chemical dis- 
sociation, heat, and magnetic fields are the effects of electricity, but 
psychologists prefer the term mental products. We know intelligence 
by what it enables us to do. 

Professor Thorndike was the first to develop clearly the idea that the 
measurement of intelligence consists essentially of some qualitative and 
quantitative evaluation of mental productions in terms of their num- 
ber, and the excellence or speed with which they are effected. That is 
the only function which any measure of intelligence can possibly have. 
Abilities are merely these mental products sorted into different classes or 
types of operation. Thus, the class of operations which consists of 
effectually associating one fact with another and recalling either or 
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both at an appropriate time is called learning; that of drawing inferences 
or educing relations between them, reasoning ability; that of merely 
retaining them, memory. The older psychologists were inclined to use 
a relatively small number of such classes based primarily on the type of 
mental process supposedly involved. More recently psychologists have 
begun to emphasize not only the processes but the content as well. 
They speak not only of memory but of auditory memory; not only of 
reasoning but of abstract, verbal or arithmetical reasoning. In a like 
manner some psychologists have begun to distinguish various kinds of 
intelligence. Thorndike, for example, has suggested subdividing in- 
telligence into three main types: (1) abstract or verbal intelligence,' 
involving facility in the use of symbols; (2) practical intelligence, in- 
volving facility in manipulating objects; (3) social intelligence, involving 
facility in dealing with human beings. The significant thing about this 
classification is that it emphasizes what a person can do as well as how 
he can do it. This distinction between function and content is fully 
justified by experimental evidence. The rating which an individual 
attains on an intelligence examination depends to a considerable degree 
on the type of test used. His score on a test made up largely of verbal 
items may differ significantly from that obtained on a test involving 
questions of social comprehension and still more from another test made 
up of items involving predominantly psychomotor reactions and the 
perception of spacial relationships. 

Though test results show that the rating which an individual attains 
will frequently depend upon the type of intelligence test used, they also 
show a contrary tendency. When large numbers of individuals are 
examined with a variety of intelligence tests, those who make high 
scores on any one of them tend to make high scores on the remaining 
ones; and the same holds for those who make low and intermediate 
scores. This dual characteristic of human abilities — their specificity 
on the one hand and interdependence on the other — ^has long been ap- 
preciated by psychologists. But unfortunately, the reaction to this 
observation was not to accept it as a fact, but rather as a logical dilemma 
from which one had to escape. The older writers tried to escape it by 
accepting the scholastic formulated faculties; the modern ones, by their 
theory of independent unit or group traits. But more than 30 years 
ago Professor Carl Spearman put an end to the dilemma by showing, 
through rigorous mathematical proof, that all intellectual abilities coul(\ 
be expressed as functions of two factors, one a general or intellectual 
factor common to every ability, and another a spedj^c factor, specific 
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to any particular ability and “in every case different from that of all 
others”. This proof first appeared as a brief article in the American 
Journal of Psychology (1904). It has since been subjected to a great 
amount of discussion, criticism and experimental investigation. We 
cannot enter into all this here, but can only indicate our own position 
by saying that Professor Spearman’s generalized proof of the two-factor 
theory of human abilities constitutes one of the great discoveries of 
psychology. 

As has often been the case in the history of science, the proof of the 
two-factor theory, in addition to being a discovery, was also an explicit 
foivnulation of an hypothesis which workers in the field had unknow- 
ingly been assuming for some time. The fact is, that from the day 
psychologists began to use a series of tests for measuring intelligence, 
they necessarily assumed the existence of a general or common factor. 
This becomes immediately apparent if one recalls what the actual 
contents of intelligence tests are. They consist of various intellectual 
tasks which we call tests that require the subject to do such things as 
to define words, reproduce facts from memory, solve problems in 
arithmetic and recognize likenesses and differences. The variety of 
taslcs used, their difficulty and manner of presentation varies with the 
type of scale employed. But so far as measuring intelligence is con- 
cerned, these specific tasks are only a means to an end. Their object is 
not to test a person’s memory, judgment or reasoning ability, but to 
measure something which it is hoped will emerge from the sum total of 
the subject’s performance, namely, his general intelligence. One of the 
Neatest contributions of Biiiet was his intuitive assumption that in the 
selection of tests, it made little difference what sort of tasks you used, 
provided that in some way it was a measure of the child’s general in- 
telligence. This explains in part the large variety of tasks employed 
in the original Binet scale. It also accounts for the fact why certain 
typos of items which were found useful at one age level, were not neces- 
sarily employed at other age levels. More important than cither of these 
details is the fact that for all practical purposes, the combining of a 
variety of tests into a single measure of intelligence, ipso facto, pre- 
supposes a certain functional unity or equivalence between them. 

The functional equivalence of the test items, an assumption implicit 
not only in the Binet Scale but in any scale which is composed of a 
variety or pool of intellectual tasks, is absolutely necessary for the 
validation of the arithmetic employed in arriving at a final measure of 
intelligence. This arithmetic consists, first, of assigning some numerical 
value to every correct response; secondly, of adding the partial credits 
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so obtained into a simple sum; and, thirdly, of treating equal sums as 
equivalent, irrespective of the nature of the test items which contribute 
to the total. For example, every test passed on the Stanford-Binet 
(between ages 3 and 10) contributes two months to the mental age 
(M.A.) score of the subject, irrespective as to w^hether the test passed 
calls for a repetition of a series of digits, the copying of a square, the 
definition of a word, or the correct reply to a common sense question. 
To all intents and purposes, therefore, the simple addition of these 
groups necessarily assumes an arithmetical equivalence of the test 
items so combined. If the different tests were taken to represent 
generically different entities, one could no more add the values assigned 
to them in order to obtain an M.A. total than one could add 2 dogs, 
3 cats and 4 elephants, and expect the unqualified answer of 9. That, 
of course, does not mean that their addition is impossible. If instead of 
being concerned with the characteristics of the dog, the cat and the 
elephant, which differentiate them from one another, we restrict our 
interest to those which they all have in common, we can say that 2 
dogs, 3 cats and 4 elephants make 9 animals. The reason we can get 
an answer of 9 here is because dogs, cats and elephants are in fact all 
animals. The addition would no longer be possible if for cats we were 
to substitute turnips. 

The same principle is involved when we attempt to add up the num- 
ber of tests correctly passed on an intelligence scale into a simple sum. 
The reason we can add together scores obtained from tests requiring 
such seemingly different abilities as those involved in solving arithmetic 
problems, repeating digits and defining words, is because they are in 
fact alike in certain ways. They are alike in that they are all measures 
of general intelligence. This means that all must have a common 
characteristic, or to use the current psychological term, a common fac- 
tor. Prof essqr^^Speaiman .has shown that such a common factor not 
only has to^ be assumed in any attempt to measure general intelligence 
by means of tests, but has demonstrated that its presence can always 
be revealed through appropriate statistical procedures. The common 
factor turns out to be a recurrent mathematical quantity which can be 
‘‘extricated"' from the tests by special correlational methods, — a quan- 
tity which he has called Just what '‘g” is psychologically and 

to what extent it may be identified with general intelligence, are still 
matters of speculation and dispute. As will be seen shortly, the pres- 
ent writer is far from being in full agreement either with Professor 
Spearman's concept of general intelligence or even with his views re- 
garding the best mode of measuring it, but as regar^ the ^monstration 
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of the existence of as a common factor, there seems to be no pos- 
Sbillty of doubt. Psychometrics, without it, loses its basic prop. 

A few words as to the nature of First and foremost, it is a 

purely mathematical quantity, ‘'originally intended to explain correla- 
tions that exist between the most diverse sorts of cognitive performance’', 
which recurs in all data obtainable from measures of intellectual ability. 
In this respect it may be said to be similar, or at least comparable, to 
some of the constants met with in physical and more particularly 
atomic measurements. But “g" is also more specific than that. It is 
evidently a kind of something that must be posited to explain the effects 
of mental work or the operations of the mind. It is, therefore, a kind 
of energy, or more correctly, a measure of the same. This is the second 
and^' more ~ general interpretation of “g”. Combining these two con- 
ceptions, we may say that “g” is a psychomathematical quantity which 
measures the mind’s capacity to do intellectual work. 

Everybody will agree that the capacity to do intellectual work is a 
necessary and important sign of general intelligence. The question is 
whether it is the only important or paramount factor. In this writer’s 
opinion it is not. Professor Spearman seemingly t hi n k s it is, although 
on this point he has failed to declare himself unequivocally. On the 
one hand, he states, “Such a factor as this [“g”] can scarcely be given 
the title of intelligence at all.” But after having said this, he devotes 
several chapters^ in an attempt to prove that the best tests of intelligence 
are precisely those which contain the largest amounts of “g”. If this is 
so, then for all practical purposes, “g” and general intelligence may be 
said to be equivalent. This equivalence, indeed, is implied by the 
mathematical relationship of the “g” and “s” factors in the two-factor 
theory. According to this relationship an intelligence scale made up 
of a large number of tests especially rich in “g” would in the end be a 
measure of “g” exclusively.^ In the writer’s opinion, such a scale 
would not be a very good measure of general inteUigence because it 
would eliminate a number of abilities essential for effective behavior. 

The view that other salient factors besides “g” enter into measures 
of intelligence is based on several sources of evidence. The first is 
clinical. We know from experience that individuals attaining identical 
scores on intelligence tests cannot always be classified in the same way. 
This is perhaps most obvious in cases where test results call for prac- 
tical action, as for example when they are used as a basis for deciding 

» Spearman, C.: The Abilities of Man. New York, 1927. 

* For, by pooling such tests, the “g'* factor (being common) becomes cumula- 
tive, whereas the sjiecific factors (being incidental) tend to cancel each other. 
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whether or not a subject should be committed to an institution for 
mental defectives. In such cases the test results, e.g., a Binet I.Q., 
cannot be used as the sole criterion. One child with an I.Q. of 75 may 
be definitely defective while another with an identical I.Q., or indeed 
one 5 or 10 points less, be far from so classifiable. Of course, the 
objection may be made that the classification of mental deficiency is 
in part a social diagnosis. But is not the capacity for social adapta- 
tion also a sign of intelligence? Should not the capacity to avoid 
mischief and the ability to work persistently at a task, enter into 
one^s definition of general intelligence just as well as the ability to de- 
fine words, and perceive analogies? The clinician’s answer has always 
been ^'yes”, and by so saying, he has implicitly assumed that there are 
other factors besides the intellective ^^g” which enter into intelligent 
behavior. Hitherto he has been unable to demonstrate their existence 
experimentally. In recent years, however, thanks to new correlational 
techniques, especially the method of factorial analysis as developed by 
Professor Thurstone in this country, a beginning has been made. Of 
particular importance has been the work of W. P. Alexander whose 
monograph on IntelUgerice, Concrete and Ahsiraat^ is in many ways 
fundamental. 

Alexander set himself the problem of testing experimentally the 
evidence for and against the main theories now currently favored m 
psychological circles. The first of these is Professor Spearman’s two- 
factor theory which we have already referred to. The other is the 
unique traits theory, according to which intelligence involves several 
abilities or factors, each ind^endent of one another. More specifically, 
his investigation took the form of an experimental study to determine 
whether test results supported the view that “practical” and “verbal” 
intelligence were each distinct and independent capacities, or the view 
of Spearman according to which both were essentially the same in that 
they were not independent capacities but only differed with respect to 
their non-intellective or specific factors. 

Alexander’s findings were extremely interesting. They confirmed 
Spearm^’s maia contentiomthat.there w^^ one and only one common 
factor in all measures of intelligence and, at the same time, showed that 
this factor alone is not suflicient to explain the total correlational vari- 
ance which existed between the tests used to measure intelligence. In 
addition to the common factor there are seemingly other broad factors 
which, while not showing the same generality, are nevertheless recur- 
rent in a significant number of abilities which form subgroups or “com- 

^ Brit, /. PyschoL, 1935. 
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munal cliisters’\ The individual tests by which these abilities are 
measured contain a common factor of their own with respect to which 
they function in much the same way. Alexander has termed abilities 
'involved in tests showing such similarity of function functional unities. 
Thus, verbal ability is one functional unity, practical ability another, 
and so on. But while each of these functional unities requires a sepa- 
rate factor to take care of its respective contribution to any global 
measure of intelligence, they are nevertheless '"definitely related”, that 
is, correlated with one another.® This means that they cannot be uni- 
tary traits in the sense implied by the unique traits theory. On the 
other hand, neither can they be considered as specific factors in the 
sense required by Spearman^s two-factor theory. For, these factors, 
unlike the "s” factors, actually contribute a considerable amount 
to the correlation variance of the test composites of which they form 
a part. 

Another important conclusion suggested by Alexander's investiga- 
tion was that in order to account for the complete intercorrelation 
variance found among any large battery of intelligence tests one has to 
posit other factors in addition to the purely intellectual ones. After 
eliminating the general factor ("g”), and sqch other factors® as were 
contributed by the above described “functional unities”, Alexander 
found that a considerable amount of his total intercorrelational variance 
was still unaccounted for. In addition to these factors there were ap- 
parently certain other supplementary global ones which, though not 
directly measurable, nevertheless contributed significant amounts , to 
the total variance of the observed data. These latter factors he has 
provisionally labeled “X” and “Z”. They cover such items as the 
subjects ,interest in doing the tasks set, his persistence in attacking 
them and his zest and desire to succeed, — ^items which might more 
familiarly be described as temperamental or personality factors, but 
which nevertheless must be recognized as important in all actual 
measures of intelligence. For this reason, one might appropriately 
refer to them as the non intellective-factors or, more specifically, as the 
non^intellective factors in general intelligence.’ 

* Thus verbal ability correlates with practical ability to the extent of .50. 

•These were primarily the factors 'V”, common to tests involving verbal 
ability, and “P* common to tests purporting to measure practical ability. 

^ For further evidence as to the existence of these factors, see D. Wechsler: The 
non-in tellective factors in general intelligence. J, Abnorm, and Soc, Psychol,, 

i94ClirriD9=ro4. 
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It thus appears that the entity or quantity which we are able to 
measure by intelligence tests is not a simple quantity. Certainly it is 
not something which can be expressed by one single factor alone, say 
“g”, whether 3^ou define it in its most general terms as mental energy, 
the ability to educe relations or merely as the intellective factor. In- 
telligence is all this and 3"et something more. It is the ability to utilize 
this energ;^’' or to exercise this ability in contextual situations, — situa- 
tions that have content and purpose as well as form and meaning. To 
concede as much is to admit that any practical definition of intelli- 
gence must be fundamentally a biological one in the wddest sense of the 
term. That has been the hypothesis assumed in the construction of the 
intelligence scales that are to be described in the ensuing pages. We 
think that they measure general intelligence in the sense defined above. 
We shall not, however, claim that they measure all that goes to make 
up general intelligence, because no tests at present are capable of doing 
it. The only thing we can ask of an intelligence scale is that it measures 
sufficient portions of intelligence to enable us to use it as a fairly re- 
liable ipdex of the individuabs global capacity. 

It is unportant to realize that intelligence tests do not and cannot be 
expected to measure all of intelligence, but it is of equal importance to 
emphasize that they measure a great deal more than the delimited 
capacities to which contemporaneous theory seems desirous of restricting 
them. Intelligence tests measure more than mere learning ability or 
reasoning ability or even general intellectual ability; in addition, they 
ine'^tably measure a number of other capacities which cannot be defined 
as either purely cognitive or intellective, — abilities heavily loaded with 
factors like “X” and ‘‘Z”, the non-intellective factors mentioned above. 
Hitherto, authors of intelligence scales when recognizing this situation, 
looked upon these factors as disturbing elements and tried as far as 
possible to eliminate them. Unfortunately, experience has shown that 
the more successful one is in excluding these factors, the less effective 
are the resulting tests as measures of general intelligence. What are 
needed are not tests from which the non-intellective factors have been 
eliminated (even if that were possible), but, on the contrary, tests in 
which these factors are clearly present and objectively appraisable. 
The performance tests described in Part II of this volume are an at- 
tempt in this direction. They are only a beginning; but the results 
already obtained show that tests of verbal ability, abstract reasoning 
and the like when used alone in a general intelligence examination give 
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only an incomplete picture of the individual’s capacity for effective 
adjustment and achievement. 

We have concluded above that intelligence tests cannot measure all 
of intelligence. We must further qualify this conclusion by the state- 
ment that at different ages they measure different portions of it. In 
general it might be stated that as the individual grows older our tests 
become less and less an effective measure of his global intelligence. The 
failure to realize this has led to a great deal of confusion in interpreting 
test results, particularly when used for the evaluation of adult intelli- 
gence. The difficulty is most acutely apparent when intelligence levels 
are given in the now almost universal notation of mental age. Most 
psychologists are aware of the fact that when an adult of 30 scores a 
mental age of 12, and a child of 12 scores a mental age of 12, their 
intelligence is not identical, yet there does not seem to be any general 
imderstanding as to why they are not identical. If what we have 
pflirl above is correct, the answer is not far to find. The basic reason a 
mental age of 12 at 12 does not mean the same thing as a mental age 
of 12 at 30, is that the measured abilities and hence the M.A. scores 
represent different portions of the subjects’ respective total intelligence. 
At age 12 the tests axe capable of tapping far more of the individual’s 
capacities than at 30.® How much more we cannot say without further 
investigation. But even if the differences were small it would challenge 
our entire present method of defining at least adult intelligence in terms 
of M.A. levels. The same observation may be made as regards intelli- 
gence quotients with even greater force, because as we shall have occa- 
sion to see, the I.Q. concept as a mental age score divided by a 
chronologic^ age score, presupposes a constancy of relationship between 
the two, which in point of fact does not exist. 

Sufficient has been said to show that the definition of general intelli- 
gence, far from being a mere theoretical question, is one which enters 
immediately into any practical attempt at measurement. It is par- 
ticularly important when we come to measuring adult intelligence, 
because we are at once confronted with a wider range of criteria against 
which our definition may be checked. But before this checking can be 
done, the tests themselves must of course have been available. The 
scales presented in the following pages are an attempt to supply this. 
The extent of the present need for them will form the content of our 
next chapter. 

It should be added that the same might be said of age 6 as compared to age 12. 



CHAPTER 2 

NEED FOR AN ADULT INTELLIGENCE TEST 


Although the earliest investigations in the field of psychometrics were 
made largely with adult subjects, the great bulk of the test data which 
now forms the basis of intelligence scales, has been derived from the 
examination of school children. The reasons for this are several. Per- 
haps the most important is the relative ease with which one may obtain 
young subjects; children are nearly always available through the schools. 
Another is the fact that it is generally much easier to devise children’s 
tests than adults’ tests, both from the point of view of the definition of 
ability measured, as well as the likelihood of interest and appeal. 
Finally, there is the fact that the results obtained by the testing of 
children lend themselves more readily to concrete applications. 

Demonstration of the value of intelligence tests for children has come 
from two important fields. The first of these is the school. From the 
very beginning it was apparent that intelligence tests, particularly of 
the Binet type, correlated very highly wdth scholastic achievement. A 
well trained psychologist after an hour of standardized testing could 
often more accurately differentiate between the superior, the average, 
and the dull than the teacher who may have had the children under 
daily observation for a period of several months. Here, then, was an 
instrument that could be of great value in the matter of proper grading 
of school children, particularly at the outset of their training. Naturally 
psychologists and educators who were at all interested in tests, devoted 
a large part of their energies to the development of these important 
possibilities. 

The other field in which intelligence tests early demonstrated their 
usefulness was in the diagnosis of mental deficiency. Tests supplied 
for the first time a quantitative measure of the degree of mental defect. 
And here the tests seemed not only applicable to the diagnosis of re- 
tardation among children, but for the classification of defective adults 
as well. The fact that the tests were originally standardized on children, 
was looked upon by those who applied them as no great limitation upon 
their usefulness. Indeed, because the tests made possible a comparison 
of adult functioning with that of children, they offered the clinician a 
new basis for classifying degrees of deficiency. We refer, of course, to 
the definition of degrees of deficiency in terms of mental age levels. 

13 
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For the first ten years after their publication, the Binet tests and those 
of their type, were primarily used in the examination of children or 
defectives. Binet, himself, seemingly intended them to be used only 
with children since the scale was originally standardized only through 
the age of J,5. The possibility of using intelligence tests with adults, 
however, was immediately apparent, and a number of efforts were made 
in this direction. The earliest of these confined themselves largely to 
the testing of special groups of adults, such as inmates of prisons or 
mental hospitals. But it was really not until 1917 when the U. S. Army 
enlisted the help of psychologists for the purpose of classifying the 
soldier draft, that the intelligence tests were tried out on a large scale 
on normal adults. In the meantime, Terman had brought out the 
Stanford Revision of the Binet Tests, the extension of which included 
tests at adult levels. Unfortunately, the urgency of examining large 
numbers of individuals in a relatively short time made it impossible to 
use this scale to any considerable degree. To meet the practical need 
for group examinations, the psychologists assigned to the task devised 
the now justly famous Army Alpha and Beta , group tests. Given as 
they were to more than a million men, they constituted the most thor- 
oughly standardized group-tests yet devised. The Alpha test, in the 
opinion of the writer, still remains the best instrument of this type. 
But its merit is primarily to enable the examiner to make a rapid exam- 
ination and subsequent classification of a large munber of subjects in a 
short time. It does not fmrnish means for an individual diagnosis.^ 

Following World War I a number of new adult group tests, mostly 
modeled after the Army Alpha, were devised. Little was done, how- 
ever, to restandardize the individual intelligence examinations although 
one of the most outstanding results of the army experience with those 
available was their general inadequacy for use wdth adults. In the case 
of the Stanford-Binet, a special study on some 400 unselected subjects 
revealed that the mental age adult norm was some three years lower 
thav that given by Terman in his book. Thus the average mental age 
of the adults® examined in the army turn out to be 13.08 years instead of 
16 years as originally claimed. Approximately the same results were 
obtained with performance tests on the same group of individuals. 

I Diagnosis of individual cases— such as of soldiers who were referred for 
discharge or to development battalions — was nearly always made on the basis of 
one or another of the individual examinations then available, namely, the Stan- 
ford-Binet, Yerkes Point Seale, Army Performance, etc. 

* Males a^s 21 to 30. 
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In view of the fact that the army data were derived from a much better 
statistical sample of adult population than that of the original Stanford- 
Binet standardization, one might have supposed that the new norms 
made available by them would have replaced the old ones; but this was 
not the case. Psychologists have from time to time protested against 
using the old norms. In some instances they have gone to the extent of 
discarding 16 years and substituting 14 or 15 years, as the denominator 
for calculating adult I.Q.^s. In other instances, they have reinterpreted 
the significance of the I.Q. when calculated on the original basis, as for 
example, by considering an I.Q, of 88 rather than 100 as average for the 
adult population. But such has been the wude acceptance of the Stan- 
ford-Binet norms, that in spite of the cuixAilative evidence as regards 
their inaccuracy when applied to the measurement of adult intelligence, 
their use for this purpose has ^o^jitinued without much abatement. It is 
true that within the last ye4r Terman and MerrilP have published a 
thorough revision of the Stanford-Binet in which 15 years is used as the 
average adult mental age. Unfortunately, these tests were again not 
standardized on adults, that is, on individuals over 18 years. The 
new Terman-Merrill like the old Stanford-Binet still leaves unanswered 
the question of the “true’^ mental age of the average adult. 

If we inquire why the Binet and other children's scales have continued 
to be used for the testing of adults, in spite of the criticism that has been 
leveled against this practice, the answer is not far to find. No better 
instruments were on hand. Up to very recently there has been no 
suitable individual adult examination available. Moreover the chil- 
dren's tests did meet a need, hovrever imperfectly. Finally, and this is 
of peculiar interest, the continued employment of these tests instead of 
making their users more alert to the tests’ inadequacies seemingly made 
them more obtuse to their shortcoming. No doubt the great difficulties 
which the standardization of a new adult scale presented, also had 
something to do with this feeling. But whatever the reasons, the 
continued use of children’s scales for adult testing has no scientific 
justification. The scales now in use fail to meet some of the most 
elementary requirements which psychologists ordinarily set themselves 
when standardizing a test. The first deficiency is that they have not 
been standardized on a sufficient number of cases. Indeed most of 
them were never standardized on any adults at all. To see how serious 
this situation is, we list below the individual tests most commonly used 
in this country and the number of adults on whom the needed norms 

• Terman and Merrill: Measuring Intelligence, Boston. 1937. 
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were ostensibly computed when the tests were first standardized. (See 
Table 1.) 

It thus appears that udth the possible exception of the Army Per- 
tamance none of the currently used scales meets the very first prerequi- 
site of a valid intelligence examination, namely the condition of having 
been standardized on a sufficiently large number of cases. The serious- 
ness of this failing is shown by the fact that no published studies with 
these tests have confirmed the norms which the authors posited for 
them. Most of these studies, it is true, were again made with relatively 
small numbers of subjects, but in every one involving populations of 
any considerable size, as for example the population sample of the sol- 
diers of the U. S. Army, used to obtain M.A. equivalents for the Alpha 
and Beta tests, the results showed that the original Stanford-Binet 


TABLE 1 


TEST 

NtTMBBB or ADm/rS 


62* 


Not specified 


73 



0 


260 

'T'r.wrvwfti-k ■RdTftairvn rif Svf.nnfnrfi-Rinftt, 

0 

J. Ci 111 XvCVlOlwl-l V/l vllC kJi/arU.iwx VI 



• Terman speaks of the tests having been given to 150 migratory workers but 
the scores obtained from these subjects do not seem to have been used in the final 
standardization. See Terman: The MeaeuTement of Intelligence. Boston, 
1916. Pp. 54r-55. 

norms needed immediate discarding. According to these norms no 
less than 34 per cent of the “flower’^ of American manhood would have 
had to be classified as mental defectives.-* Such a number is palpably 
absurd, but perhaps no more astounding than some of the earlier esti- 
mates of the incidence of mental deficiency among delinquents, which 
ran as high as 90 per cent.® 

A more obvious shortcoming of intelligence examinations now used 
for adults, and one that has often been noted, is the unsuitability *of 
much of the material that forms part of the examinationsr Many of the 
test items do not seem to be of the sort that would either interest or 

Using 16 years as an adult denominator and an I.Q. of below 75 as indicative 
of mental deficiency. The figure 34 per cent is derived from the data furnished 
in table 54, Armg MeTmirs, p. 391. This is for English-speaking Whites. For 
the colored population, Southern drafte, the per cent of defectives by the same 
criterion is almost doubled, 

* Pintner, R.: Intelligence Testing, New York, 1931, 2nd ed., pp. 375-377. 
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appeal to an adult. The early lay critics of intelligence tests empha- 
sized this fact particularly. Although their criticism was often exag- 
gerated, it contained more than a grain of truth. To ask the average 
adult to say as many words as he can think of in three minutes, or to 
make a sentence of the words to asked paper my teacher correct I my, 
and assume that he will be either interested or impressed, is expecting 
too much. The average child generally responds to such questions as a 
matter of course. The average adult will, as often as not, start wonder- 
ing as to what possible purpose or meaning the tests can have. Such 
remarks as, “That's baby stuff," “Why do I have to do this?" and “I 
never had that in school," are very common, particularly among the 
less alert subjects. Asking the ordinary housewife to furnish you with, a 
rhyme to the words, “day", “cat", and “mill", or an ex-army sergeant 
to give you a sentence with the words, “boy", “river", “ball", is not 
particularly apt to evoke either interest or respect. 

Apart from the matter of interest and appeal, there are other serious 
objections to the type of material generally employed in children's tests 
which make them unsuitable for adult use. One of them is the fact that 
credit for correctness of response so often depends upon the individual's 
capacity to manipulate words (or objects), rather than upon compre- 
hension of their meaning. Thus, a subject asked to make a sentence 
of the words, for the started an we country early at hour might think 
that the main idea to be apprehended, and therefore expressed, is that 
somebody started somewhere at an early hour. Nevertheless, “We 
started early at an hour for the country", is marked as a failure.® The 
subject must say, (to be credited with a correct response), “We started 
for the country at an early hour", or at least, “At an early hour we 
started for the country". Unfortunately, many items on current 
scales call for an accuracy of precisely this sort. 

Another limitation to the use of tets on adults, originally standard- 
ized on children, is that many of these tests lay altogethei\^o much 
emphasis on speed as compared to accuracy.*^ This does not imply 
that speed of response is not important in measuring intfe^lligence. 
Actually there is a high correlation between speed and accurac y. In- 
deed, in the case of children at least, it would seem that of the tw^p, speed 
alone is often the better measure in the sense that it correlates higher 
with other global ratings of intelligence. In the case of adults, however, 

• Cf, Terman, L.: The Measurement of Intelligence^ p. 218. 

^ This emphasis, to be sure, varies widely in different scales. In the JBinet, 
for example, speed of response is of secondary importance, but in most perform- 
ance tests it is given considerable weight. 
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general experience would indicate that speed ^lone is not the better 
measure. This is particularly true for older subjects who do badly on 
nearly all “speed’’ tests. A number of explanations have been offered 
for this fact. The one which is of particular pertinence to our discus- 
sion is the possible influence of the different attitude which adults take 
toward set tasks or test situations. 

When you tell a childj “Put these blocks together as fast as you can/’ 
the chances are he will accept the instructions at their face value. One 
cannot be so sure of a similar acceptance in the case of the adult. He 
might be a type of individual characterizing the attitude, “look before 
you leap”, or “first try to figure the thing out”. In that case his atti- 
tude might only serve to get him a lower intelligence rating. On many 
performance tests the difference of a few seconds in the time taken to 
complete a set task often reduces a subject’s score considerably.® Facts 
like these do not, of course, imply that time scores cannot be used in 
testing adult intelligence, but they do show that their evaluation and 
interpretation may be considerably different from that which we place 
upon them in the case of children. 

The foregoing considerations illustrate some of the more important 
reasons why tests standardized on children cannot be used on adults. 
We have yet to mention the most fundamental of all; the fact that adult 
intelligence cannot be evaluated in the same terms as those generally 
employed in defining juvenile intelligence. We refer to the definition 
of intelligence in terms of mental age (MA.) levels and mental age 
scores. The concept of mental age, fundamental as it is to the defini- 
tion of juvenile intelligence, may be grossly misleading w-hen applied 
to the definition of adult mental capacity. A mental age score of 12 
years, for example, in a child of twelve does not represent the same level 
of mental ability that it would in the case of a man of forty. Even 
more misleading is the use of the M.A, as a basis for calculating adults’ 
indices of iDrightness (I.Q.’s) by methods identical to those employed 
in the c^se of children. The continued use of such methods by psychol- 
ogists isf^due in part to the fact that children’s tests have continued to 
be theii main instruments for measuring adult intelligence, and in part 
to the fact that the significance and implications of the M.A. and LQ. 
are still not fully imderstood by many. In view of the important r6Ies 
which ^the M.A. and LQ, play in all contemporaneous attempts at 
evaluating and measuring intelligence, the criticisms we have just made 
are obviously very serious. It is important to discover whether they 
are justified and, if so, to see how they can be remedied. 

* For example, completing the Seguin Form Board in 10 seconds earns the 
subject an MA. score of 15 years; in 13 seconds, an M A. score of only 13 years. 



CHAPTER 3 


THE CONCEPTS OF MENTAL AGE AND 
INTELLIGENCE QUOTIENT 

The term mental age, as now used in psychology, was first coined by 
Binet, who offered it as a way of defining different degrees or levels of 
intelligence. The novel point was that he proposed to define these 
levels in terms of the measured abilities of children at different ages. 
This presupposed that intellectual ability could be measured and that 
it increased progressively with age. Both of these assumptions have 
proved correct! Binet^s great contributions, however, were more 
specific. (1) He devised a series of graded intellectual tasks whereby 
intelligence could in fact be effectively measured; (2) he described a 
mode of evaluating the results in terms of age units such that the aver- 
age child of 6 might be said to have a mental age of 6, the average child 
of 9 a mental age of 9 years, and so on. The technique of scoring tests 
in terms of age units has come to be known as the mental age method, 
and the scores obtained by this method as mental ages (M.A.^s). 

The method by which an age-intelligence-scale is devised is briefly as 
follows: A series of intellectual tasks of varying difficulty is assembled 
and administered to subjects of different age groups. The responses are 
scored and collated, and, on the basis of the per cent of individuals 
passing and failing the various tasks at different ages, certain of them 
are selected as suitable tests. The tests selected are then ordered 
according to difficulty and combined into groups usually of six or eight 
to form various year levels. The number of tests per year level deter- 
mines how many credits are assigned to each test. For example, if 
there are six tests per year, each test passed counts two months. The 
final score or M.A. which an individual gets on the tests is the sum of the 
partial credits he obtained for the tests passed at different year levels, 
expressed in months and years, plus a certain bonus for tests which it is 
assumed he could have passed if the tests had been given him.^ The 
sum of both, expressed in months and years, is the individuaFs men- 

1 The bonus consists of the M.A. score automatically credited for items below 
the year level on which the subject has passed all tests. For example, if a child 
passes all tests at year IX (known as the basal year) he is given full credit (96 
months) for all tests through year VIII, even though he has not actually taken 
the tests of the lower year levels. 
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tal age score. Thus if a child passes 6 tests at year IX, 4 tests at year 
X and 3 tests at year XII, his M.A. score is 12 + 8 + 6 + 96 (bonus) 
or 122 months. All this is, of course, quite familiar; but we have 
thought it worth while to summarize the actual procedure for obtain- 
ing a mental age in order to throw into focus a number of fundamental 
facts of which even psychologists sometimes lose sight. 

The first of these facts is that a menial age, however obtained, is just a 
score. Basically it differs in no way from any other type of score given 
in terms of the number of items passed out of a possible total. Thus 
when a child gets a mental age score of 122 months on the Binet Scale, 
the important fact is that he is credited with having passed 61 test 
items.^ The fact that we multiply each item by 2 so as to be able to 
express the score in temos of months and years is primarily a matter of 
convenience. An intelligence rating expressed as a score of 61 points 
is as real and can be made as understandable as a mental age score of 
122 months. It has the same arithmetical properties and the same 
possibilities of evaluation, including that of calculating intelligence 
quotients. Of course, it also has the same limitations. 

The second point of importance about the M A. method of evaluating 
intelligence is that it inevitably limits the range of possible scores. 
Beyond certain points M.A. equivalents are impossible. These limits 
are reached, for any given test, whenever the mean scores made on the 
test cease to increase with advancing chronological age. The limiting 
mental age varies from test to test. Thus on the Manikin Test the 
mean scores cease to increase above age 8; on the Ship Test, above 12; in 
the case of Memory Span for Digits, they stop increasing at 14 and in 
the case of the Vocabulary Test, at about age 22. 

The point at which mean scores cease to increase with advancing age 
is in part dependent upon the difficulty of iffie test used and in part a 
function of the general maturation process. Thus in the case of the 
Manikin and Ship Tests the mean scores fail to increase with advancing 
age because the tests are too easy. In the case of the Memory Span for 
Digits and the Vocabulary Test the differences between the mean 
scores at higher ages disappear because the abilities measured by these 
tests no longer increase with age. Thus the ability to repeat digits 
stqgsTmprovmg^t not because it is impossible to attain a higher 

score than those generally attained by the average 14 year old, but 

* Actually he will have been tested with considerably fewer items. This 
number 61 includes both the items actually passed as well as those for which he 
received automatic credit. See previoxia note. 
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because the mean scores for the average 16, 18, and 20 year old are no 
higher. 

What is true of the various abilities considered individually is equally 
true of measures of these abilities when combined into batteries^' of 
tests to yield measures of general intelligence. Beyond the age of 
15, mean scores on practically alP intelligence scales cease to increase 
significantly with age. Psychologists have generally interpreted this 
fact to mean that intellectual ability stops growing at about that age. 
With this general conclusion we are in agreement, but shall not pursue 
it any further at this time.'* Of more immediate concern to us here 
is what Bearing the fact that all intelligence scales eventually reach such 
a point may have on the concept of mental age. The first implication 
is obvious: the mental age concept has a natural limit of applicability. 
When a test reaches a point beyond which mean scores on it cease to 
increase with age, then any higher scores for which the test allows can 
no longer be expressed in terms of mental age. 

The fact that every intelligence scale attains a point beyond which 
mean scores for successive age groups no longer increase with age, does 
not mean, of course, that scores higher than those calculated at the 
limiting age levels, cannot be attained. On the contrary, the fact that 
the mental age scores are average scores, shows that there must be a 
large percentage of individuals who attain higher scores than the mean. 
The only question is how to interpret, or at least make use of these. One 
way is to assign hypothetical values or M.A. equivalents to them, based 
on the relative frequency of their occurrence; another is to accept them 
at their face value and to assume that if there were higher mental age 
scores they would increase with chronological ^e precisely in the manner 
in which the scale provides for it. In either case, we get what are 
obviously extrapolated values, that is, M.A.’s which are only hypo- 
thetically related to the actual data, and whose maximal values are 
limited only by the range of the test scores. Thus, in the case of the 
Stanford Revision of the Binet it is possible to obtain an M.A. of 19 
years and 6 months, on the Otis Tests of Mental Ability an M.A. of 18 
years and 6 months, and on the Terman-Merrill Revision of the Binet an 
M.A. of 22 years and 10 months. With these limits, we are not par- 
ticularly concerned, nor with the doubtful statistical procedures by 

* This is not true of intelligence scales like the Otis Self-Administering Tests 
of Mental Ability, which are markedly influenced by educational factors. 

^See Wechsler, David: The Range of Human Capacities, Baltimore, 1935, 
Chapter 8. 
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which they have sometimes been obtained. But it is very important 
to appraise their possible psychological significance. 

A mental age score above an age beyond which mean scores increase 
with age, e.g., an M.A. of 20 years, can have only one of two meanings. 
The first and most important one is that which it could have had, if it 
signified the same thing as that which is implied when we say that a 
child has a mental age of 7 or 8 or 10, namely, that it represents the 
average mentality of the average individual of that age, expressed in 
^onths and years. Such an interpretation for a mental age of 20 
years is clearly incorrect. The average mental age of the average 20 
year old is not 20 but 15 years. The second possible meaning of mental 
age of 20, is that it represents a measurable level of intelligence that is 
above the average, the precise amount of which for the sake of conven- 
ience, is expressed in the year-month notation. In that case, however, 
the above notation acquires an altered connotation and can only add 
confusion to the original concept. To speak of an individual having an 
M.A. of 20 years is both practically and scientifically meaningless. 

What we have said thus far does not, of course, deny the value of the 
mental age concept altogether, but only points to its inevitable limita- 
tions. The most important of these limitations, as we have just seen, 
is that the M.A. method of defining intelligence cannot logically be used 
to define levels of intelligence higher than that obtained by that age 
group beyond which M.A. scores cease to increase with chronological 
age. The precise age at which this occurs is still in dispute. It cannot, 
in fact, be definitely fixed because the mental age limit attained is a 
function of the actual tests used, and there can be no possible agreement 
so long as different intelligence scales are composed of different batteries 
of tests. Present day evidence, however, is that the age beyond which 
M.A.^s generally cease to increase with chronological age is not far from 
15 years. In the opinion of the writer, the mean for all tests pooled 
together would probably be a little liigher.® But whether the mean 
adult M.A., as this limiting value is called, be 14 or 16, the fact is that 
the M.A. method of measuring intelligence breaks down even before 
either of the^ ages is attained. Actually, it begins to fail about age 

® In all probability the age of maximal functioning mental ability falls between 
22 and 25 years, but the psychometric differences between ages 15 and 25 are 
generally so small and influenced by so many factors that they may be disre- 
garded for all practical purposes. According to the results on the Bellevue 
Adult Scale the age of maximal mental functioning ability falls at approximately 
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12, for though the means of the actual test scores continue to increase 
with age above that age, they do so with progressively diminishing and 
ultimately negligible amounts. That they do fail is of particular 
moment in calculating indices of brightness or intelligence quotients. 
Before considering this problem any further, therefore, we shall find it 
necessary to consider the meaning of these indices. 

The most universally used of all indices of intelligence is the intelli- 
gence quotient (I.Q.). It is generally calculated by dividing a subject's 
mental age (M.A.) by his chronological age (C.A.). Thus if a child of 
10 attains an M.A. of 12, his LQ. is 1.20 or, omitting the decimal point, 
120. As thus defined, the intention of the LQ. is straightforward and 
easy enough to comprehend. But its full meaning really depends upon 
what we understand by the terms M.A. and C.A. The meaning of an 
M.A. we have already discussed at length. It is a test score, for conven- 
ience expressed in a month-year notation. But what is a C.A.? We 
do not, of course, refer to its literal definition, namely the life or chrono- 
logical age of an individual at the time he is examined, but its meaning 
as a part of the LQ, formula. Let the reader pause and try to answer 
the question for himself. If he has not given it much thought before, 
it is probable he will find the task far from easy. Nor is he likely to find 
the answer by consulting the average text-book. Most of them will 
define an LQ.; some an M.A.; none, so far as we know, a C.A. This 
omission is unfortunate, because much of the nonsense which has been 
written about the intelligence quotient may be attributed to the failure 
of its critics to understand what a C.A. really is. Nevertheless, the 
definition of the C.A. in the LQ. formula is almost glaringly obvious. 

Like the M.A., a C.A . is just a score. It is a score which the examiner 
assumes would be the score of an individual of a given age if his ability 
corresponded exactly to that of the average individual of his own (the 
subject's) life age. Thus, if a given individuaTs age is 8 years, his 
C.A. score, if he were an average 8 year old, would also be 8 years. If 
his life age were 12 years and 9 months, his C.A. score, assuming him 
to be a normal individual, ought likewise to be 12 years and 9 months, 
and so on. A well standardized scale is one where the tests are so 
arranged as to make this assumption warrantable and at least approxi- 
mately correct.® But in any case the important fact is that the C.A. 

® Actually this has been shown to be the case for the midyear points only; 
for example, that the average OJ year old child attains a mental age of OJ and 
hence an I.Q. of 100. But it has not been shown for the average child of 9 years 
and 2 months, the average child of 9 years and 9 months, etc. It is probable. 
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is merely a converted score just like the M.A. The thing that makes 
them alike, however, is not the fact that they have the word age in 
common, but the fact that they are both test scores measured in identical 
units. 

Bearing these facts in mind we may now define an intelligence quotient 
in terms of what it actually represents: An intelligence quotient is the 
ratio between a particular score which an individual gets (on a given 
intelligence test) and the score which an average individual of his life 
age may be assumed to attain on the same test, when both scores are 
expressed in the same notation (e.g., in terms of months and years). 
The usual formula 


LQ,= 


M.A. 

C-A. 


should really be stated as follows: 

j Q — attained or actual score 
Expected niear^ score for age 

The great value of the I.Q. is that it furnishes us with a method of 
defining relative intelligence. It tells us in the first instance how bright 
an individual is compared with one of his own age. But it tells us, or 
at least is intended to tell us, even more than that. The I.Q. is offered 
as an index which is independent not only of the particular score which 
an individual makes on a particular scale, but also of the particular age 
at which he happens to make it. It is thus a measure which presum- 
ably defines the relative brightness or intellectual possibilities of an 
individual more or less permanently. Under ordinary conditions an 
individuaPs I.Q. is supposed to remain the same throughout life, or at 
least throughout the age limits covered by the scale. Psychologists 
refer to this property as the constancy of the I.Q. 

The constancy of the I.Q. is the basic assumption of all scales where 
relative degrees of intelligence are defined in terms of it. It is not only 
basic, but absolutely necessaiy that I.Q.^s be independent of the age at 
which they are calculated, because unless the assumption holds, no 
permanent scheme of intelligence classification is possible If an in- 
dividual at one age attained a certain I.Q. and when examined a few 


however, that within the limits of ages 5 to 12 years the deviations from the mean 
at any given intermediate age would not be very great, and the correspondence 
of the C.A. and M.A. values may for practical purposes be assumed to hold. 
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years later another I.Q., or if a particular I.Q. meant one thing at one 
age and quite a different thing at another, the I.Q. would obviously 
have no practical significance. It is, therefore, highly important to 
ascertain whether I.Q.’s, as now calculated, do in fact remain constant. 

The facts regarding the constancy of the I.Q. are essentially of two 
kinds. The first pertains to the mean values of the I.Q. at successive 
chronological ages. In the case of most of the better standardized 
tests, it can be readily shown that, at least for the standardizing samples 
of population, the mean I.Q.’s over the middle portions of the scale, are 
regularly found to be about 100. The^fact that they generally do not 
deviate more than two or three points from this value is interpreted to 
show that the I.Q. remains constant from age to age. This interpreta- 
tion, however, goes beyond the fact. The only legitimate conclusion 
that can be drawn from them is that the I.Q.’s not far from the average 
will remain constant. It does not necessarily imply that I.Q.^s at any 
considerable distance from the mean, let us say one or two standard 
deviations from it, (for example, I.Q.’s of ^5 or 70) will also remain 
constant. That will depend not only upon the average values of the 
I.Q. at different ages, but also on their respective variabilities at these 
ages. Hitherto, it has been assumed that these variabilities were the 
same or differed by no greater amounts than might be expected from 
sampling errors. In point of fact, however, little evidence has been 
produced to test this assumption which remains distinctly controversial. 

The early standardizations of the Binet Scale contained little data 
which would enable one to evaluate the variability of the I.Q. at differ- 
ent ages. It was not until Burt’s^ revision appeared that such data 
became available. Burt himself did not actually take up the problem 
of the variability of the I.Q. (or mental ratio, as he called it) but did 
furnish data from which this variability might be calculated. One is 
able to do this because he gives for each life age, not only the mean 
M.A. score for each age, but also its standard deviation (S.D.).* Using 
these figures one may calculate what I.Q. may be expected for an in- 
dividual of any given age, whose position is any given S.D. distance 
from the mean. If all I.Q.’s were constant, not only at the mean, but 
any distance from it, all individuals deviating by the same fractional , 
standard deviation from the mean would have the same I.Q.’s. Actual 
calculation, however, shows that this is generally not the case. 

Analysis of Burt’s figures reveals that except between ages G and 10 

^ Burt, Cyril: Mental and Scholastic Tests, London. 1933. 

^ Loc. cit., p. 145. 
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the difference in variability of the LQ. is so great as to significantly alter 
its value. For example, at the distance of 2 standard deviations from 
the mean an individual at age 6 attaining that rank would get an LQ. 
of 76; at age 10, he would get an LQ. of 81; at age 14, an LQ. of 84. 

More direct evidence of variability of I.Q. with age is furnished by the 
statistics of Terman and Merrill in their new revision of the Stanford- 
Binet. Their tables® are worth examining in detail. They contain 
some very significant surprises. In the first place, even the mean 
I.Q.^s show great variability, differing by as much as 9 per cent at differ- 
ent ages; for example, from a mean LQ. of 109.9 at age 2^ to a mean 
I.Q. of 100.9 at age 14. But even more significant are the differences 
between the standard deviation of the means at different age levels. 
These differ by as much as 7.5 units, and in consequence give rise to 
significant deviations from the LQ. expectancy. Thus, the standard 
deviation for the mean LQ. at age 12 on the Revised Stanfcrd-Binet 
(Form L), is 20.0, and at age 6, only 12.5. Accordingly, depending upon 
the age at which an individual is being tested, he may obtain different 
LQ.’s even though his relative brightness remains unchanged. Thus^ 
supposing him to be an individual whose position is 2 standard devia- 
tions from the mean, he would get at age 6 an LQ. of 75 and at age 12 
an I.Q. of 60. This would imply that an I.Q. of 60 at age 12 means the 
same as an LQ. of 75 at year 6.^° 

We have just shown that LQ.’s calculated by the M.A. over the 
C.A. method do not remain constant for individuals whose ratings are 
any considerable distance above or below the average of their age group. 
It also appears that the method does not furnish constant values even 
for mean LQ.’s except at certain ages. This becomes apparent if 
instead of comparing the mean I.Q.’s at different ages, one compares the 
original test scores from which they were derived. The most effective 
way of doing this is by plotting original test scores directly against 
chronological age, without any further manipulation of the data than a 
prior transmutation of the scores into units of equal amount. We have 
done this with intelligence te^sts obtained with the Bellevue Intelligence 
Beale, and the resulting growth curve is shown in figure L 

® Terman and Merrill : Measuring Inlelligence^ pp. 36-40. 

Terman and Merrill are inclined to account for the’ large fluctuations in 
variability at certain ages as being primarily due to sampling errors or to the 
influence of pubescent changes. But though it is true that such differences are 
not obtained at all ages, they are by no means exceptional. 
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The curve shown in figure 1 is a logistic cun^e typical of all growth 
phenomena. As such it shows at once that intellectual growth does not 
proceed by equal amounts throughout its development. Accordingly, 
any method of calculating I.Q.’s w^hich assumes a linear relationship 
between chronological and mental age cannot possibly give constant 
values for any considerable portion of the growth period. 

The assumption of the linear relationship between M.A. and C.A. 
made by current methods of calculating I.Q.'s leads to certain inevi- 
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table consequences. The first of these is that for the average individual, 
the mean value of the I.Q. vill change from age to age. At early age 
levels or at other periods where the mental growth is rapid, the I.Q. will 
tend to be above the mean of the entire population; at the upper ages 
where mental growth is slower, it will be below the mean. Thus on 
the Stanford-Binet (original standardization), the mean I.Q. for ages 
3 to 5 is approximately 102; for ages 14 and 15 it is more nearly 98. On 
the new or revised Stanford-Binet where the mean I.Q.’s are system- 
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atically above 100 for all ages, those between ages 2^ to 5 average about 
105; whereas those at ages 14 and 15 average about 101 . 5 

A second result that might be expected from a study of the mental 
growth curve is that a child’s LQ* would tend to fall off as he grew older, 
and that this failing off would be more marked in the case of the mentally 
retarded than in the case of the normal or superior child. The explana- 
tion of these phenomena resides in the particular logarithmic relation- 
ship that connects mental with chronological age. In a gross way this 
may be expressed by sa 3 dng that the numerators used in calculating 

I.Q.’s by the method increase more slowly than do the denominar 

tors and that this difference in the rate of increase is most marked in the 
case of mental defectives. Here again observation confirms what may 
be expected theoretically. Thus, results^^ obtained by retesting mental 
defectives show that I.Q.’s of all such individuals decline systematically 
with age.^* 

Much of what we have discussed so far concerns the calculation of 
I.Q.’s for children. The problem becomes more acute when an attempt 
is made to apply the M.A. over C.A. method to the calculation of adult 
I.Q.’s. This brings us to the third result that may be expected from a 
study of the growth curve, namely the ultimate arrest of mental growth. 
The successive increments by which test scores increase with advancing 
chronological age, not only diininish progressively, but ultimately Vanish 
altogether. 

Psychologists have from the first recognized that dividing an adult’s 
M.A. score by his actual C.A. in order to obtain his I.Q. would lead to 
absurd results. To avoid these absurdities, they have generally adopted 
,the plan of using as devisor the highest C.A. beyond which the observed 
M.A. scores cease to increase. This age has been set by different authors 
at from 14 to 18 years. The actual age chosen has depended in part on 
the experiences of the author and in part on the particular scale em- 

It should be added that the discrepancies would be much larger if the scaling 
methods employed did not compensate for them by using different criteria in 
selecting the tests at different age levels. At the lower ages, authors of tests 
usually require 60 to 70 per cent of ‘^passes^’ for locating a test at a particular age 
level, whereas at the upper ages only from 50 to 60 per cent is required. 

1* See particularly, Kuhlmann, F.: Results of repeated mental re-examination 
of 639 feeble-minded over a period of ten years. J. Appl. Psychol^, 1921, V, 
192-224. 

” Incidentally, the fact that the I.Q.’s of individuals of low intelligence tend 
to decrease markedly with age entails some paradoxical consequences. One 
of them is that after being sent to a special school for several years, a mentally 
retarded child might well come out with a lower I.Q. than when he entered. 
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ployed. But apart from the fact that the assumed age has varied from 
test to test, the fixing of it at any particular point has introduced an 
assumption which has served to destroy the meaning of the I.Q. alto- 
gether. This assumption is that M.A. scores remain constant through- 
out adult life or, at least, up to the point where senility begins. If this 
were true, the curve of mental growth from age 16 onwards would be a 
straight line parallel to the C.A. axis. Actually, it is no such thing, 
Jbut a curve which decreases more or less regularly with increasing age 
as shown in figure 2. 


CURVE OF MENTAL GROWTH AND DECLINE 

BELLEVUE FULL SCALE 
AGES 7-68 



Fig. 2 


The curve shown in figure 2 is based upon the data obtained with the 
Bellevue Intelligence Examination for ages 7 to 65. We have already 
discussed, the characteristics of it up to the age of puberty. The part 
which interests us now is that portion of it that describes the observed 
changes in mental ability from about 14 years onwards. We might add 
that the curve which we give here is very similar to the one that we 
derived^^ from the studies of Miles^® and those of Conrad and Jones.^® 

Wechsler, D. ; The Range of Human CapacitieSj p. 85. 

Miles, W. R.; Measurement of certain abilities throughout life span. Pro- 
ceedings National Academy of Science, 1931, XVII, 627-633. 

Miles, W. R., and C. C. : Correlation of intelligence scores, age, etc. Amer, J . 
Psychol,, 1932, XLIV, 44-78. 

Jones, H. E., and Conrad, H. S. : The growth and decline of intelligence, etc. 
Gen, Psychol, Monog,, 1933, Xin, 223-298. 
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What they all show is, that beginning at an age varying from 15 to 22, 
all scores of mental ability, far from remaining constant, start to fall off. 
The point at which the falling off begins and the rate at wMch it 
progresses will vary from test to test and cannot be fixed. It is true 
that for most intelligence scales the difference between ages 15 and 25 
are for most practical purposes negligible, but above that age the decline 
becomes appreciable and increasingly important. After 35, they begin 
to decrease so fast that the use of a single denominator for calculating 


I.Q/s for adults will introduce serious errors. To calculate an I.Q. for a 
man of 60 by dividing his M.A. score by 15, is as incorrect as to obtain 
an I.Q. for a boy of 12 by dividing his M.A. score by 15. Nevertheless 
that is precisely what the general practice has hitherto been. All adult 
I.Q/s have been almost universally calculated by using some assumed 
constant denominator. The effect of using a single denominator for 
calculating I.Q.’s of adults is to destroy the very basic concept of the I.Q. 

The essence of the I.Q. concept is that part of its defimtion which 
asserts that for a valid evaluation of an individuaFs brightness, one 
must compare his mental ability to that of the average individual of his 
own^-Age. The actual method by which this is done is of secondary, 
though not negligible importance. The essential thing is that the I.Q., 
however obtained, takes into consideration the age of the subject. As 
soon as the age factor is discarded, the I.Q. ceases to be an I.Q. in the 
original sense of the term. If this is correct, it follows that all adult 
I.Q/s which are obtained by dividing an individual score (M.A.) by any 


fixed denominator, are not LQ.’s at all. 

From the theoretical point of view, the precise value of the denomi- 
nator used, that is whether the adult M.A. is taken as 14, 15 or 16 years, 
is of no consequence. All are equally fallacious in the sense that they 
assume that mental ability (as measured by tests) remains constant 
after any one of these ages. That unfortunately is precisely what most 
psychologists have assumed ever since the I.Q. was introduced. The 
result has been that in-so-far as adult ratings are concerned, psychol- 
ogists have been really not getting I.Q.’s at allj instead they have been 
calculating various indices of mental efBciency. They have been com- 
paring adults not with individuals of their own age, but with those of 
some favored, usually optimally functioning, age ^up. In brief, by 
assuming a fixed adult mental age, psychologists .have been calculating 
not intelligence quotients but ieffi ciencv quotients..^ 

Our investigations into the nature and meaning of the I.Q. have led 
us to some very disturbing conclusions: First, we found that even if 
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I.Q.’s as now calculated could be said to be constant about the mean, 
this could in no way imply that they were also constant at all other 
points. Next, we found that the M.A. over the C.A. method could not 
possibly give constant LQ.^s for all ages, because the assumed linear 
relation between C.A.’s and M.A.’s does not in fact obtain. Finally, 
we were forced to the uncomfortable conclusion that most adult I.Q.’s 
as hitherto obtained, were not I.Q.’s at all but some sort of eJEciency 
quotients. 

If these conclusions are correct, they suggest that the current method 
of calculating I.Q.’s is in urgent need of revision. To some this might 
further convey the belief that the entire method of evaluating I.Q.^s 
is invalid. The latter, however, would be an unwarranted conclusion. 
It is true that when M.A. scores are plotted against chronological age, 
the relationship observed is not a linear one, but if we examine the curve 
of mental growth we will find that this relationship does hold over cer- 
tain portions of it. Thus although the curve shown on page 27 is far 
from a straight line, the section of it from approximately age 7 to 10 is 
pretty close to being one. This means that the I.Q.’s between these 
ages will remain approximately constant. With a little leniency we 
might say that the straight-line relationship is sufficiently good to give 
valid results from ages 4 to 12 or 13. But below, and particularly 
above, these limits the linear relationship does not hold at all. 

Again if we examine the latter portion of the curve of mental ability 
we note that although it does not remain parallel to the abscissa above 
age 15 as would be required by the assumption of a single adult M.A. 
denominator its decline up to about age 25 is relatively slight. Accord- 
ingly one might allow that I.Q.’s calculated by the M.A. over the C.A. 
method would remain fairly constant over this age span. Here too, 
with a little leniency, we might extend the age limits a few years, say up 
to 35. But above this age the decline of the curve becomes so marked 
that, as we have already noted, the fixed denominator assumption will 
give entirely invalid results. 

We shall now proceed to describe a method of calculating I.Q.’s 
which will make allowance for the form of the curve for mental growth 
and also, as we now know, of mental decline. This method was arrived 
at only after a variety of statistical procedures had been tried out. All 
of them had this in common, that they made use of scores which had 
previously been transmuted into units of equal amount. This was 
achieved in the usual way by expressing our raw scores in terms of 
This portion is not shown in our curve. 
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standard deviation units.^® Before making use of these results, however, 
it was necessary to define zero intelligence. This problem, as might 
have been expected, presented the greatest difl&culty. 

The zero point in a scale of mental measurements, like that of any 
physical scale, may have one of two meanings. It may signify ^just not 
anything’ of whatever it is we are seeking to measure, as in the case of 
the zero of the absolute temperature scale, or merely represent some 
defined point of reference from which we find it convenient to start our 
measurements, as the freezing point of 'water in the centigrade scale. 
In either case, its explicit definition is imperative to enable us to express 
scalar amounts as multiples or fractions. All LQ.’s, of course, are 
precisely such multiples or fractions. Their magnitude obviously 
depends upon the points of reference from which they are being calcu- 
lated. In the case of the M.A. over the C.A. method of calculating the 
I.Q.’s the zero point for both the numerator and denominator is the 
* assumed age of the child at birth. In point of fact this assumption is 
incorrect. A child at birth does not have zero intelligence nor for that 
matter zero chronological age. When a child is born it is already nine 
months “old” and manifests a certain amount of intelligence. What- 
ever the situation, however, it could not be used as a point of reference 
for a scale like ours. Accordingly are forced to look for another 
point of reference to define our zero, that would be related in some 
quantitative way to actual test scores. 

A number of suggestions have been made by various writers as to how 
zero intelligence might be defined in terms of scores. The most cogent 
one is perhaps that of Professor Thurstone^® who defines it “as the 
amount of test performance at which variability vanishes.” Such an 
amount, it might appear, ought not to be difiScult to determine for any 
given scale. All that is seemingly necessary is that we find a point 
below which no test score of any kind is possible. But the situation is 
not so simple. The reason is that what we are seeking is not a point on 
our scales beyond which there are no lower scores but really a point 
corresponding to a degree of intelligence below which intellectual ability 
may, to all intents and purposes, be said to be non-existent. That, of 
course, is quite a different matter. An individual failing to make any 
score on a given test might still maice some sort of score on a much 
easier test. For example, a zero score on the Army Alpha is equal to a 

See appendix. 

Thurstone, L. L.: The absolute zero in the measurement of intelligence. 
Psychol, XXXV, 1928. 
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score of about 12 on the Army Beta;^® and there are other tests still 
easier than the Army Beta on which individuals can obtain some sort 
of score, even when unable to do anything on Beta. We therefore can- 
not take the lowest score attainable on any particular scale as the true 
zero point of intelligence. Some other method of arriving at it is 
necessary. 

The usual way out of the difficulty has been to turn to the probability 
curve for an inspired guess. The technique consists of assuming a zero 
point so far from the mean of the group tested that the slight amount 
of ability this assumed point represented would make it highly im- 
probable that any individual could be so ill endowed as ever to reach it. 
In terms of units of deviation, custom has tended to set this point at 
—5 S.D. from the mean, and our first I.Q. Tables were calculated on 
this basis. 

I.Q. tables calculated by setting a zero point at —5 S.D. from the 
mean, gave us fairly satisfactory intelligent quotients. The method of 
obtaining them, however, seemed altogether arbitrary. We really had 
no rationale for the particular limits which we had chosen. It is true 
that when calculated with the zero limit set at —5 S.D. we obtained 
I.Q.’s that were not very much different from those we could obtain 
by the M.A. over C.A. method, after transmuting the sigma scores into 
the equivalent M.A.’s, nor indeed ver^^ much different from those we 
eventually obtained by the method we finally adopted. But we could 
offer no justification for our procedure, other than that of matter-of-fact 
empiricism. Moreover, when we set the point at — -5 sigma, we dis- 
covered considerable irregularity in the I.Q. limits for our various age 
groups. In particular, we found that I.Q.’s below 100 tended to be 
lower, and LQ.^s above 100 higher than seemed warranted. For all 
these reasons, we decided to abandon the idea of defining zero intelli- 
gence and to seek instead a defined base that was at once more rational 
and less difficult to manipulate. 

The base which we finally chose to define, was that amount of intelli- 
gence which was represented by the individual who was one probable 

Conversely, a score of zero on the Army Beta would be equal to a score of 
about —71 on Army Alpha. The reader who is puzzled by these numbers, should 
recall that mathematically 0 is an indeterminate quantity. In psychology it 
means so small an amount of ability as to be just insufficient to enable its possessor 
to obtain the lowest possible score on a given test. Each test will therefore have a 
different zero point. Realization of this fact will show why scores on different 
tests forming a single battery, cannot he added together unless they have been 
j)reviously equated against one another. 
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error away from the mean. We chose that distance because, through 
almost universal acceptance, the deviation —1 P.E. is used as the 
dividing line between individuals who are referred to as average (normal) 
and subaverage (below normal). According to this view, an average 
individual is a person who falls within the middle 50 per cent of the 
group, a range which on the normal probability curve is defined by the 
value +1 to —1 P.E. from the mean. 

After setting -1 P.E. as the definable point from which our I.Q.'s 
were to be calculated, we next had to decide upon the value of the I.Q. 
which should be assigned to it. We say we had to decide because the 
absolute numerical value of the I.Q., as we trust the reader is now con- 
vinced, is altogether a matter of convenience. An individual’s I.Q-, as 
we have so often insisted, merely defines his relative position among the 
group with which he has been compared. The important fact about it 
is this relative standing and not the numerical rating which we may 
happen to assign to it. 

The numerical value of an I.Q. has no more fixed meaning than a 
passing mark on a scholastic examination. We can, ad lib,, set the 
passing mark at 60, 70 or 90 without altering its implication, if by 
passing we mean the attainment of certain relative excellence or level 
of efficiency. In this sense, the meaning which any mark has is derived 
from its relative position among the set of marks that are being evalu- 
ated. For example, a mark of 90 may mean very superior or barely 
passing, depending upon the total range of the marking scale. It is 
the same with I.Q.’s. In the final analysis the level of intelligence 
which any I.Q. represents, will depend not upon its absolute, but its 
relative magnitude. 

While the numerical rating that can be assigned to an individual at- 
taining any distance from the mean (in our case — IP .E.) is altogether a 
question of convenience, certain practical considerations limit the 
particular values which we may employ. The most important of these 
is the value of the mean I.Q. Here the historical definition has set it 
once and for all at 100. For ail other I.Qds there is no such historical, 
nor statistical cogency. The only limitation imposed upon us is that 
I.Q.^s of individuals below the mean must be less than 100. But in 
choosing a base from which all I.Q.’s were to be calculated, it was 
obviously a matter of common sense to select such value for it as would 
be in line with the order of numerical values of LQ.^s now in general use. 
In the case of most intelligence scales, an I.Q. of 9Q has come to be 
interpreted as the lowest limit of what is generally called average intelik, 
gence. Since the distance —1 P.E. from the mean designated the lower 
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limiting value of the category “average’’ in our own classification, we 
decided to use 90 as the I.Q. against which the distance of —1 P.E. 
might conveniently be equated.^i 

By equating the distance — 1 P.E. against the I.Q. of 90 we at once 
defined, not only this particular I.Q. but all other I.Q.'s as well, because 
the equation by which this is done automatically defines the zero point.^^ 
This zero point is obviously that S.D. distance from the mean which 
gives us an I.Q. of 90 for any individual who attains the position of 
— IP .E. from the mean. Having obtained this zero point, it is then a 
matter of simple arithmetic to draw up one’s I.Q. tables. All that is 
necessary is to determine the mean and standard deviation of one’s 

distribution, prepare a table of “z” scores, and by the formula ^ ^ ^ 

Aw 

obtain for each actual score the corresponding I.Q. This is the method 
we used for establishing our I.Q. tables. 

Having described a new method of calculating I.Q.’s, we shall now 
briefly summarize its advantages over those currently in use: First, it 
dispenses with all assumptions as regards the precise relations between 
intellectual and chronological ratings of growth, and in particular, 
assumptions as regards the rectilinear relation between the M.A. and 
C.A. Secondly, in the calculation of adult I.Q.’s it relieves us of the 
need to co mm it ourselves to any fixed average adult mental age. Each 
age defines its own adult denominator. Thirdly, the method enables 
us to calculate I.Q.’s which maintain the same meaning throughout the 
life of the individual. Fourthly, it retains the original and only im- 
portant meaning of the I.Q. namely, that of an index of relative 
brightness. 

So much for the meaning, implication and method of calculating a true 
I.Q. All of this, however, is only a means to an end, — ^the classification 
of intelligence itself. The purpose of an I.Q. is to enable us to tell how 
bright or how stupid a person is, and how on the basis of test scores to 
classify him scientifically, with respect to what we are pleased to call 
^ native intell e ctua l endowment. This is a large order, and we shall 
neecTaluil chapter to see how it may be done. 

From a statistical point of view, the use of the probable error has several 
advantages. First, the P.E, or any multiple of it is a, very definite measure. 
Second, it is a measure that is influenced relatively little by extreme or accidental 
cases. Third, it possesses certain mathematical properties which are related to 
other fundamental measures of variability as well as to the normal probability 
<^rve as a whole. Lastly, it involves us in no assumption which experience can- 
not either verify or disprove. 

** See Appendix 1. 
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When psychologists speak of classifying intelligence, they use the term 
in a somewhat specialized sense. The purpose of a mental classification 
is not, as in most other scientific classifications, ‘‘the detection of the 
laws of nature,” It does not correspond, for example, to the chemist’s 
arrangement of the elements into a periodic table or even the zoologist’s 
subdivision of animals into vertebrates and non-vertebrates and then 
again into their various orders. The psychologist’s effort at classifying 
intelligence is more like what the layman does when he tries to dis- 
tinguish the colors of the rainbow. The analogy is more than a super- 
ficial one. General intelligence is a non-interrupted continum like that 
of a rainbow spectrum. One level of intelligence merges into the next 
like colors seen through a refracting prism. Levels of behavior which 
present cejtain patterns (or hues, to retmm to the color analogy) are 
called defectives, others a little farther up the scale are called borderlinfe, 
still others dull-noitoal and so on until we reach the other , end of the 
scale where they are labeled very superior, pre^eious or ^ffius. The 
borderline runs into the dull-normal and the high average into the 
superior just as the orange-yellow runs into the yellow, and the deep 
violet into the indigo. In both cases it is convention or custom which 
has assigned them their respective names. 

The earliest classifications of intelligence were very rough ones. To a 
large extent they were practical attempts to define various p^tems of 
behavior in medical-legal terms. These terms, like idiot, imbecile and 
moron, coined by the early writers on the subject, stiU form part of our 
present day terminology. The contribution of modem psychology has 
been not so much in the matter of defining new configurations of intelli- 
gent behavior as in giving precision to the already available concepts, 
through the introduction of quantitative methods. A mental defective 
is now defined not merely as a person who ‘‘through congemtal^arrest 
or imperfect mental development is incapable ... of managing himself 
or his affairs with ordinary prudence,” but as one who on standardized 
tests fails to attain an I.Q. or an M.A, of a particular level. In brief, 
psychologists have attempted to classify intelligence by means of quan- 
titative measurements. This has been a great step forward. The 

36 
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progress to be realized is like that achieved by physicists in designating 
colors by their wave-lengths instead of their hues. 

While the theoretical advantages of classifying intelligence quanti- 
tatively are obvious, the practical gain of such classifications is not 
always as apparent. The reason for this is that the merit of any quan- 
titative classification necessarily depends on the validity of the data 
employed in arriving at it. These data, in the field of mental classifica- 
tion, usually consist of measures of brightness derived from one or 
another intelligence test. The measure of brightness most commonly 
used is the intelligence quotient. It is for this reason that we devoted 
such considerable space in our last chapter to the discussion of the LQ- 
But the calculation of^ a correct and constant I.Q. is only the first 
prerequisite for its use as a basis of classification. We still have to 
decide as to the meanings we can attach to our indices, however obtained. 

TABLE 2 


Classification of intelligence according to Terman 


LQ. BANQB 

CLASSIFICATION 

Below 70 

Definite feeble-mindedness 

70-80 

Borderline deficiency 

80-90 

Dullness 

90-110 

Normal or average intelligence 

110-120 

Superior intelligence 

120-140 

Very superior intelligence 

140 and above 

Genius or near genius 


In practice the procedure has consisted of pairing certain familiar 
qualitative terms against I.Q. ratings falling within certain limits. 
Thus in Terman’s classification, individuals attaining I.Q.*s below 70 
are designated as mentally defective, those between 80 and 90 as dull- 
normal, those between 90 and 110 as average, and so on. According to 
Kuhlmann, the I.Q. limit for the corresponding categories are: mentally 
defective, below 75; borderline, 75-84; dull, 85-94; average, 95—104. 
And other writers have used still other delimitations. 

When one examines the various I.Q. classifications that have been 
offered, a number of questions present themselves. The first is vrhy the 
limiting I.Q. values are the ones given and not some others. As an 
example we may consider Professor Terman’s original classification 
which is reproduced in table 2. The first thing that strikes us is that the 
limiting second cipher of each new class is a zero. Thus the category 
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borderline begins at 70 I.Q-, the category dull-normal at 80 I.Q., the 
category average at 90 I.Q., and so on. It is difficult to conceive of any 
statistical procedure which would give such neat correspondence. It is 
possible that the limiting numbers actually obtained were sufficiently 
near the even decades so that, allowance being made for the variability 
of the measuring instrument, it may have been permissible to round off 
the numbers. But the actual chance of each of them beginning with 
the zero is only 10“^ that is, one in 10,000,000. 

A more serious objection to Professor Terman’s I.Q. classification 
than the improbability of all class intervals beginning and ending 
where they do, is that we are furnished no rationale for the choice of the 
indicated class intervals. We are not told why an I.Q. between 70 and 
79 rather than between 72 and 84 was chosen to denote borderline 
intelligence, or an I.Q. between 110 and 119 rather than between 113 
and 126 to signify superior intelligence. There is an implication that 
in some way the intervals used are based on the normal curve of distribu- 
tion. But there is no explicit statement that such a scheme was in- 
tended. In any case, we have no way of determining from the class 
limits themselves what that scheme may have been. 

Our comments on Professor Terman’s I.Q. classification hold equally 
for all others published. We have chosen it to illustrate the short- 
comings of current classificatory schemes not because it is the most 
vulnerable, but because it is the most familiar of any now in use. For 
all practical purposes it may be said to have become the standard 
scheme of mental classification in this country. The Terman classifica- 
tion is being used not only for I.Q. ratings obtained on the Stanford- 
Binet, but for I.Q.’s derived from a host of other tests for which they 
cannot possibly hold. For the latter misapplication Professor Terman 
is, of course, in no way responsible. Indeed he has on different occa- 
sions cautioned psychologists against this practice. Little attention, 
however, seems to have been paid to this admonition. We have seen 
equivalent Binet I.Q. ratings reported for nearly every intelligence test 
now in use.^ In most cases the reporters proceeded to interpret the 
I.Q.’s obtained as if the tests measured the same thing as the Binet, and 
the indices calculated were equivalent to those obtained on the Stanford- 
Binet. Thus an I.Q. of 75 on an Army Alpha, a Pintner-Paterson or 
even a Porteus-Mazes Test might alike be interpreted as signifying 
borderline intelligence; an I.Q. of 85 on any of these, as dull -normal 

i Including such as were derived from single test scores, like the Portfeus 
Mazes, the Healy P.C. II, and the Trabue Sentence-Completion Tests. 
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intelligence, and so on. The examiners were seemingly unaware of the 
fact that identical I.Q.’s on the different tests might well represent very 
different orders of intelligence. 

But let us return to the question of what I.Q, limits ought to be 
assigned to the various qualitative categories of intelligence. Clearly 
the basis of any scheme will, in the last analysis, be largely a matter of 
convention. This follows from the fact that the absolute value of an 
I.Q. is a purely arbitrary number. Its magnitude, as ^ve have seen in 
the last chapter, can be manipulated to suit the convenience of the 
investigator. Accordingly, every author of a test is in a position to 
devise his own I.Q. scale. A free exercise of this privilege would, how- 
ever, lead to confusion, because one would not be in a position to inter- 
pret the I.Q.'s of any given scale without detailed knowledge of the 
author’s standardizing technique. Some agreement, both as to system 
of notation and interpretation of results, is obviously necessary. Such 
agreement could best be achieved through conventions established by 
an international meeting of psychologists and psychiatrists. Un- 
fortunately authors of tests are far from ready for such agreement. One 
is therefore left in a position of either having to adopt schemes of 
classification already in vogue, or risking further complication by the 
addition of others. We have tried to compromise with the situation by 
devising a scheme of I.Q. classification, which though new, does not 
differ^ too much from those in current use. This scheme of classification 
is given in tables 3 and 4. 

The basis for our classificatory scheme is the definition of intelligence 
levels in terms of statistical frequencies. Each intelligence level is 
defined as a class interval embracing a range of I.Q.’s falling at such and 
such distance from the mean, where these distances are expressed as 
multiples of the probable error. Thus a mental defective is a person 
who falls at a distance of 3 or more P.E. below the mean. In terms of 
percentile ranking he falls approximately among the lower 2.2 per cent 
of the total population. Similarly a person of borderline defective 
intelligence is an individual who attains an I.Q. that falls between a 
deviation of —3 P.E. and —2 P.E. from the mean or in terms of percen- 
tile rank a position anywhere from circa the lowest 3rd to circa the low- 
est 10th percentile. And so with the other categories. Our choice of 
the limiting points was only in part arbitrary. In the case of mental 
defectives we had available vaiious estimates of the probable incidence 

*When we say ‘^does not differ too much” we refer only to the numerical 
magnitude of the I.Q.'s and not to the method by which the I.Q.’s were derived. 
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of mental deficiency in this country. These estimates, though vaiying 
greatly among themselves, gave a mean figure which was not ar 
about 3 per cent of the total population. It therefore seemed reasonable 
to us to define the mental defective group as those individuals who at- 
tain I.Q.’s falling at a distance of -3 or more P.E.’s from the mean. 
This distance is equivalent to about 2.2 per cent of the total area of the 
normal curve. For the intermediate categories such as borderline and 
dull-normal intelligence, as weU as those of average, high-average, 


TABLE 3 

Siaiistical basis of intelligence classifications {theoretical) 


CliASaiFlCATION 

umrs IN TEEMS OP P. E. 

PEE CENT 
tNCEUDED 

I 


—3 P.E. and below 

2.15 


-2 P.E. to -3 P.E. 

6.72 

Viiill T'Jnrmnl 

-1 P.E. to -2 P.E. 

16.13 

A vprape .. . 

-1 P.E. to +l P.E. 

50.00 



-hi P.E. to -h2 P.E. 

16.13 

Sup6rior. 

-h2 P.E. to -h3 P.E. 

6.72 

Very Superior 

-h3 P.E. and over 

2.15 


TABLE 4 


Intelligence classification according to I.Q. — ages 10-60 (actual) 


CULSaOTCATtON 

I.Q. LIMITfl 

PER CENT 
INCIiTJDEI> 

Defer-tiv^^ 

65 and below 

2.2 


66-79 

6.7 

Dull Normal 

80-90 

16.1 

Average 

91-110 

50.0 

Drigbt Normal 

111-119 

16.1 

Superior , 

120-127 

6.7 

Very Superior . . 

128 and over 

2-2 





superior and very superior, we decided to use tlie intervening integral 
multiples of the probable error. 

Our scheme of classification is sjunmetrical, comprising as many 
classes above the mean as there are below it. In the case of categories 
below the mean, it was easy to take over the terms now in general use- 
In the case of classes above the mean we lacked one verbal category to 
give us a symmetrical classification. We were confronted with the 
problem of deciding what to call the group plus 1 P.E. to plus 2 P.E. 



THE CLASSIFICATION OP INTELLIGENCE 


41 


above the mean. Since the individuals comprised in this category 
form a group of subjects who are as much above average as the dull- 
normal are below the average, a logical term that suggested itself was 
that of Bright Normal. The term is rather clumsy but better than most 
that come to mind. As a second choice there is the somewhat long but 
descriptive term High-Average-to-Superior, and we offer this as an 
alternative. It should be noted that the phrase High-Average-to- 
Superior does not have the same denotation it has in the Terman classi- 
fication. 

The final classification at which we have arrived, together with the 
percentage included in each category, is given in table 4. These per- 
centages though justified by a rational statistic are, of course, in no wise 
definitive. If, for some reason, future experience would show that the 
present limits are not the best, they can be altered in the light of that 
experience. Indeed, if any one already disagrees with the limits as 
here given, he may substitute others of his own hking and use our own 
data with which to do it (table 5, p. 42). Our classificatory scheme, 
however, ought to serve as a model for others. More particularly we 
trust that the example set here, of presenting basic test score distribu- 
tions with their appropriate constants, will be followed by other inves- 
tigators. With such data available it would be a matter of simple 
arithmetic to equate any one scale against any other. This would 
render comparison of results obtained with different tests a matter of 
scientific evaluation instead of, as at present, a matter of arbitrary 
guessing. 

The classification offered above, like all others making use of I.Q.^s, 
is based essentially on a statistical concept of intelligence. It differs 
■ from other classifications of this kind in that our statistics have been 
more rigorously developed, and above all by the fact that our assump- 
tions have been more explicitly stated. But the important thing about 
all such classifications (whether one admits it or not) is that they aban- 
don all attempts at aij ^^qjuf^^defiintiqj^ o^ intelligence. jLa I.Q. 
merely teUs you how^/mui^^etter worse, dr fixaeh'^ above or 
below t^^erage^toy’ttd^ when compared with persons of 

his own^-age. What that average represents we really do not know. 
In a point scale it is some numerical score; in a mental age scale, an 
M.A. equivalent. Most people can readily see that a point score has 
no absolute significance, because among other things its numerical value 
is so obviously dependent upon the nxnnber of items that happen to 
comprise the scale. In the case of the mental age scores, even psychol- 
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ogists are often under the impression that we are dealing with some 
absolute quantity, and the impression is even more common among 
psychiatrists. There is a rather widespread view that in defining in- 
telligence in terms of mental age, we are doing so in terms of some 
basic unit of amount. That, as we have seen, is a mistake. A mental 

TABLE 5 


Perceniile ranks for T.QJs {Bellevue intelligence scales) ages 10-60 


PERCENTILE RANK 

LQ. 

1 

59 

3 

68 

5 

73 

7 

77 

10 

81 

15 

85 

20 

89 

25 

91 

30 i 

94 

35 

96 

40 

98 

45 

99.7 

50 

101,4 

55 

103 

60 

105 

65 

106 

70 

108 

75 

110 

80 

112 

85 

115 

00 

118 

95 

123 

97 

125 

99 

130 


age is just a test score and differs from other arithmetical summaries, 
only by the fact that it happens to be in a year-paonth notation. The 
mental age notation has a number of advantages, but among these is 
not the magical one of being able to transmute a relative into an abso- 
lute quantity. In brief, mental age is no more an absolute measure of 
intelligence than any other test score. 
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We have at this point returned again to the question as to what a 
mental age really is, because a number of authors, and no less an au- 
thority than Doll,^ have suggested and indeed urged that intelligence 
be classified on the basis of mental age ratings rather than on I.Q.’s. 
Their arguments in favor of this change may be said to be of two 
kinds. The main argument against the use qt:the-T.Q. for the classifi- 
cation of intelligence is the fact that the^I.Q. does not ^main constant. 
This criticism, as we have seen, is justified."^‘'‘Bnt, as we have further 
shown, this is not the fault of the I.Q. but of the particular method by 
which it has been calculated.'^ The second important argument in 
favor of the M.A. over the I.Q. as a basis for classifying intelligence 
is that the M.A. does so in terms of fixed levels and hence definitely 
known amounts of intelligence. There is a further implication, though 
not stated in so many words, that an M.A. level can be looked upon as 
a sort of absolute measure. We have already shown that this cannot 
possibly be. But, in any case, it would be a mistake to set up the M.A. 
as a competitive base for the classification of intelligence. To do so 
would be tantamount to abandoning almost entirely the statistical 
concept of intelligence for which psychologists have so long worked. 

One cannot emphasize too often the importance of the statistical 
concept of intelligence for the science of mental classification. It was 
first introduced in psychology by Galton when he defined genius. A 
genius, according to Galton, was a man “who [because of his eminent 
work] achieved the position of one in each nnllion.’^ Of course the 
genius’ rarity or uniqueness is not the only characteristic which dis- 
tinguishes him from the average man.® Genius is also determined by 
jvhat a, man does as well as the expertness with which he does it. The 
thing done' must be in some way e^emed by those capable of judging 
its merits. From this point of view, men of genius, according to Galton, 
are those “whom the whole intelligent part of the nation mourn when 
they die, who deserve a public funeral, and whom future ages rank as 
historical characters.” But with his own intuitive genius Galton realized 
that it is not possible to define various degrees of ability, however great 
and however measured, in terms other than those of relative position. 
A genius was one who with regard to any estimated ability attained a 

»Doll, Edgar A.: Improper use of the I.Q. Jowr. of Delinquency y 1920, V, 
pp. 67-70. 

* This limitation is obviated by the method of calculating the I.Q. given in this 
book. 

® For other concepts of genius see Wechsler, B. : Range of Human Capacities. 
Chapter VIII. 
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position of one in a million, just as “an eminent man was one who 
reached the position attained by one person in 4,000.’" Not being 
concerned in his studies of “Hereditary Genius” with the average type 
of individual, he did not have occasion to define the dull, average, or 
even superior. But if he had, it is clear that he w'ould have defined 
them in a very similar w^ay. We have, in a sense, continued Galton’s 
task of defining these remaining groups as regards intelligence. Like 
geniuses, the average, dull and defective individuals are persons who, on 
a particular intelligence scale, reach a position attained by one person 
in such and such total number. Our statistical notation is somewhat 
different from Gallon’s but it can be readily translated into his. Thus 
our average individual is one who attains a position of plus 1 to minus 1 
P.E. from the mean, which is the same thing as the position attained 
by one in every two persons. A superior person is one who attains a 
position of plus 2 to plus 3 P.E. above the mean, which is equivalent 
of saying that he is one person in 15, and so on with our other cate- 
gories. Our scales do not pretend to measure genius. The highest 
rating we have is that of Very Superior Intelligence, that is, a person 
who attains a position of 3 or more P.E.’s from the mean. This is a 
position attained by one person in every 50. It is possible for indi- 
viduals to obtain scores on our scales which would give them higher 
ranking, but we are rather hesitant about calling a person a genius on 
the basis of a single intelligence test score. 

The statistical concept of intelligence and its logical implications are 
extremely difiScult for some people to accept because at times it seem- 
ingly leads to impractical if not absurd consequences. Such conclusions 
do not devolve from the concept itself but may result from an incom- 
plete understanding of it. This is perhaps best illustrated by the reac- 
tions of certain psychologists to the question of the need of special 
norms for special groups. Clearly the statistical definition of intelli- 
gence implies that norms obtained on any particular sample are valid 
only for such groups as the sampled population represents. It does not 
limit the size of the subsequent groups to which the norms be applied; 
these may be as large as the representativeness of the tested sample 
provides for; but it does put a restriction on the type of individual who 
may be included for classificatory purposes. Thus test norms obtained 
on Englishmen cannot be used for classifying Fiji Islanders,® This is 

® It might be argued that this limitation holds only for Fiji Islanders in the 
Fijis and does not apply to a Fiji Islander in London. Here he has to match his 
wits with the average Englishman and could therefore be legitimately tested by 



THE CLASSIFICATION OF INTELLIGENCE 


45 


obvious to everybody. The principle involved, however, becomes 
less obvious when applied to less divergent groups; for example, the use 
of identical test norms for negroes with tests originally standardized 
on white populations; it becomes still less so when the differentiae which 
might distinguish the groups, such as “nationality’^ “economic condi- 
tion” and “social status”, are themselves hypothetical. Nevertheless, 
the limitations still hold. If, for example, social status were a factor 
that influenced these scores, norms obtained on any particular social 
group could not be used on any other which differed significantly from 
it with respect to this factor. If one did use them, the terms average, 
defective and superior would lose their statistical meaning. 

Failure to xmderstand this fundamental implication of the statistical 
concept of intelligence inevitably leads to confusion. You cannot on 
the one hand agree to define intelligence in terms of relative position 
and then disregard the rules by which such a classification is governed. 
When you do, incongruous and absurd consequences are inevitable. 
Thus L. S. Hollingworth rejecting the idea of separate norms for 
separate social classes writes: “If carried out to its logical conclusion 
this would mean that by measuring the inmates of schools for the feeble- 
minded, we might obtain a norm which would be applicable to such 
inmates, on the basis of which they could be classified as ‘normal.’ 
This statement seems like a conclusive argument, but actually only 
shows how imperfectly its author has grasped the full statistical con- 
notation of intelligence. She is quite correct in inferring that on the 
basis of norms obtained from a feebleminded institution, the average 
defective would rate as a normal intelligence. Indeed it is even conceiv- 
able that when classified on the basis of norms obtained on Fiji Islanders, 
many of the individuals of such an institution^ might have to be classi- 
fied as of superior intelligence. But what Dr. Hollingworth failed to 
see is that norms obtained on inmates of a feebleminded institution 
would be valid only for that institution, and that alone. The answer to 

the same tests which we used on any other Londoner. The rejoiner is valid if by 
intelligence we mean intelligence as the Englishman conceives it. With this 
definition the Fiji Islander might well disagree. As scientists, we should at least 
allow him the opportunity of offering his own. It is possible that an Englishman 
tested with a Fiji Islander’s test might not do very well either. The problem rs^ 
obviously more complicated than this simplified statement of the case indicates. 
But we cannot dilate upon it without entering into a discussion which would carry 
us far beyond the scope of this book. 

’ Requoted from Wells, F, L.: Menial Tests in Clinical Practice, p. 60. 

» Particularly if it were one “specializing” in high-grade defectives. 
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Dr. Hollingworth’s quandary is that the subjects of her illustrative in- 
stitution are not a separate but a selected population. They are merely 
the tail end of our larger population of which, for classificatory purposes, 
they form a part. They are not mental defectives because they happen 
to be in an institution for the feebleminded; they are in an institution 
for the feebleminded because they are mental defectives. 

The great advantage of using the I.Q. as a basis for mental classifica- 
tion is that it does not permit us to lose sight of the fact that all meas- 
ures of intelligence are necessarily relative. Nevertheless, for certain 
practical purposes, it is sometimes necessary to use test results as if 
they did represent absolute quantities. This is the situation when we 
use aptitude tests as measures of mental efficiency. In testing apti- 
tudes, we may set up a minimal passing mark and then use this mini- 
mum as a standard for calculating indices of efficiency. The same sort 
of application may be made of intelligence tests. We may say, for 
example, that in order to be a good teacher or a good mechanic, a 
subject must have a minimal intelligence test score of such and such an 
amount. If now the LQ. is used as a measure of the subject's intelli- 
gence, it is clear that the denominator used in calculating it assumes the 
rdle of the minimum score in the case of the aptitude test; and, if this 
denominator is constant, it will partake of all the properties of an 
absolute measure. Such application of the I.Q. is permissible, but when 
used in this way it is important to recognize that the I.Q. has been 
transformed into an E.Q. Efficiency Quotient). 

We have already referred to the difference between intelligence quo- 
tients and efficiency quotients. An intelligence quotient measures a 
person’s relative to those of individuals of his own age group. 

For this comparison the entire group is assumed to be statistically 
homogeneous. In the case of an efficiency quotient we are not inter- 
ested either in the person’s age or any other factors which influence the 
LQ. but only as to how his abilities compare with those of a fixed 
standard. Our point of view would be very much like that we would 
maintain in buying a machine. AJl that one is usually interested in 
when bu3dng a machine is that it be capable of a certain amount of 
work in a given time and at a specified cost. The kind of material 
used or the time it took to manufacture it would, in most instances, be 
of little consequence provided the machine met our specifications. 
We can, if we wish, treat intelligence ratings in much the same fashion, 
but then it is only fair that we distinguish intellectual ability as a meas- 
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ure of intelligence from intellectual ability as a measure of mental 
efficiency.^ 

Although the I.Q. is the best single measure of intelligence, it is 
neither the only nor a complete measure of it. Intelligence, like^ 
personality, is too complicated an entity to be defined by a single 
number. It is a function of other factors beside sheer intellectual abil- 
ity. We know that this must be so, because indhdduals having the 
same I.Q.’s may differ considerably as regards either their actual or 
potential capacity for intelligent behavior. These other factors, — 
drive, emotional balance, persistence, — ^are not always measurable or 
even easily discernible but have to be taken into account in concrete 
situations. In the practical classification of subjects, one often has to 
go beyond the point of merely obtaining an accurate I.Q. Sometimes 
it is necessary to take into account not only the subject’s obvious and 
measurable responses during the examination, but also the record of 
his behavior prior to his coming for the examination. 

Our last remarks suggest that in the definitive classification of a 
person’s intelligence some regard must be paid to the subject’s past 
history, that is, his social, emotional, and, in the case of adults, his vo- 
cational and economic adjustments.^® The land of life one lives is 
itself a pretty good test of a person’s intelligence. When a life history 
(assuming it to be accurate) is in disagreement with the ^^psychometric”, 
it is well to pause before attempting a classification on the basis of tests 
alone. Generally it wiU be found that the former is a more reliable 
criterion of the individual’s intelligence. Inexperienced examiners are 
likely to over-look this fact, just as psychiatrists tend to over-empha- 
size it. Similar disregard of this fact is often met with in individuals 
who engage in what we may call apersonal psychometrics, — teachers 
who give group tests, school psychologists who are restricted to getting 
I.Q.’s, and college professors who merely write about them. Unfor- 
tunately, the medical profession, imtil recently, has been inclined to 
restrict the psychologist to just such apersonal psychometrics. Even 
more enlightened psychiatrists look upon an intelligence examination 

* To meet the need of those who wish to use intelligence tests as measures of 
mental efficiency, we have calculated what we term Efficiency Quotients for the 
Full Bellevue Intelligence Scale. These are given and explained in appendix 2. 

Doirs Social Maturity Scale is an attempt to do this in a systematic way. 
See Doll, Edgar A.: A genetic scale of social maturity. Amer. Jr, Orthopsy- 
chiatry, 1935, V, 180-188. 
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as they do upon a Wassermann test; when an LQ. does not come up to 
expectations, they feel privileged to disregard it. Apart from the fact 
that the analogy between a ps^^chometric and a Wassermann test is at 
best a superficial one, our experience has shown that the average psy- 
chiatrist is no more expert in the field of intelligence diagnosis than the 
average teacher. Moreover, the r6le of a competent psychologist is 
not that of a laboratory technician. His techniques are not an end in 
themselves, but a means of diagnosis. More important than his abil- 
ity to obtain an accurate I.Q. is his ability to inte^ret it correctly 

Non-msasurable factors enter into the classification of all levels of 
intelligence, but the evaluation of them is particularly important in 
defining the feebleminded group. To call a person a mental defective 
is a serious diagnosis. At its mildest, the result is to stigmatize the 
person so labeled; at its worst, it may determine whether he will be 
institutionalized for the greater part of his life instead of being per- 
mitted to work out Ms salvation in the community. In the case of 
the child, mental deficiency involves not only the general question of 
educability, but the specific problem of training and treatment. In the 
case of an adult it may also involve the question of legal responsibility. 
Mental deficiency is thus a medical and legal, as well as a psycholo^cal 
and social concept. This fact complicates the problem of classification. 
Each science necessarily has its own points of view, and this gives rise 
to the question whether any single system of classifying mental defec- 
tives can include them all. Our next chapter will take up this question. 

“ The interpretation of psychometric results, in our opinion, is or should be 
the job of the psychologist who administers them. Even the familiar I.Q., as we 
have seen, is a rather complicated quantity. It is not reasonable to throw the 
onus of interpretation upon the average doctor, teacher, judge and social worker 
to whom reports containing I.Q.’s eventually go. Unfortunately the common 
practice of sending numerical data to schools, social agencies and courts has in 
many instances served to deprive the psychologist of that function. The net 
result has been that the doctor, teacher, judge and social worker frequently take it 
upon themselves to do the interpretation. This is in part due to the fact that 
persons in administrative positions in time acquire a belief in their own expertness 
on a great many different subjects. In part, however, it is due to the fact that 
psychologists themselves have too often been remiss in the way that they present 
their results. A psychological report which contains two or three different I.Q.'s 
accompanied by a mass of technical analysis, is hardly what the lay person wants 
or can digest. In practical situations nobody is particularly interested in the 
particular score which a subject obtains on a particular test. What he wants to 
know is what that I.Q. means in terms of general or specific ability for adjustment. 
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The concept of IMental Deficiency has, like many other concepts in 
psychology, imdergone considerable modification since the turn of the 
century. This change has involved two separate, though not unrelated, 
alterations in point of view. The first pertains to the abstract defini- 
tion of the teim, the second to its practical application. Psychologists 
have not only sought to give a more definite answer to the question, 
‘^Who are the mental defectives?”, but also sought to furnish quanti- 
tative methods for differentiating them. The shift in point of view 
has been from the older moral, medico-legal essays at abstract defini- 
tion of mental deficiency to the current quantitative, statistical, prac- 
tical methods of measuiing it. Psychologists do not now speak of social 
defectives of rnor^ imbeciles, but of mental ages and I.Q.’s. Accord- 
ingly a mental defective is no longer defined merely as one who be- 
cause of arrested mental development, etc., is unable to handle himself 
or his affairs with ordinary prudence, but as an individual whose lack of 
intellectual endowment is such as to render him incapable of attaining 
a minimum score or rating on certain standardized tests, for example, 
an M.A. of 10 years, or an I.Q. of 70 on the Binet Intelligence Ex- 
amination. 

The definition of mental deficiency in terms of attained mental age or 
I.Q. levels on intelligence tests, represented a maiked step forward but 
did not completely solve the problem of classification. The task still 
remained of arriving at (and, of course, justifying) the I.Q. or M.A. lim- 
its assigned to various degrees of mental deficiency as well as the cate- 
gory as a whole. And here we run into much the same difficulties that 
w’e met in the general problem of mental classification. The defiboition 
of mental deficiency is only a special or limiting case in the definition of 
levels of intelligence. Like the designations average, dull, superior, 
etc., it can only be delimited unambiguously in terms of statistical 
criteria. Accordingly, the most that we can say unequivocally, is that 
a mental defective is an individual who on a w ell standardized in 
“gence tesCati^Hs a rating which places him among tte lowest P^o^two, 
four or x-percent of the population. Of course this is not all that we 
can say about a mental defective. We can describe what he can or 
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cannot do, the chances of his reaching a certain grade in school, his 
likelihood of learning a given trade, and so on. All these are signif- 
icant and useful facts to know, but in themselves cannot be used for 
psychometric classification. 

The usual method of defining mental deficiency psychometrically is 
in terms of I.Q. or M.A. We have already discussed what these signify. 
It is important to reemphasize that their value as a basis of mental 
classification lies wholely on their statistical correlates. We may say 
that individuals who attain I.Q.’s below 70 are mental defectives so long 
as we intend, and the limit actually cuts off the lowest 2 or 3 or any 
other agreed upon percent of the general population. But we cannot 
use this criterion if in point of fact it does not agree either as to intent 
or fact. For example, the New York State mental hy^ene law provides 
that individuals attaining I.Q.’s below 75 on the 1916 Revision of the 
Stanford-Binet can be classified as mental defectives. As it also states 
that I.Q.’s of adults are to be calculated on a 16 year basis, actual appli- 
cation of the criterion would automatically classify some 20 per cent 
of the white and a considerably larger percent of the adult colored popu- 
lation of the United States as mental defectives. It is extremely doubt- 
ful if those who formulated the provision knew, much less intended, the 
provision to be as drastic as it would turn out to be if actually applied. 

The statistical definition of mental deficiency for which we have argued 
differs from most current psychometric definitions in several respects. 
The most important of these differences is that it designates no precise 
mental age or I.Q. which specifically classifies an individual as a mental 
defective. It argues for the use of standardized tests, but at the same 
time implies that an identical M.A. or I.Q. may in the case of one indi- 
vidual denote deficiency and in the case of another fail to indicate it. 
There are three reasons for this. The firat derives from our biological 
definition of intelligence which insists that intelligence involves other 
items than purely intellective factors; the second follows from our purely 
statistical definition of degrees of relative ability; and the third from our 
refusal to accept the current notion that mental deficiency is a unitary 
entity. The first tWo points have already been elaborated in previous 
chapters and we shall now only enlarge upon the last contention. 

Mental deficiency, unlike typhoid fever or general paresis, is not a 
circumscribed entity.^ A mental defective is not a person who suffers 

1 The remarks that follow are in part a reproduction of the author^s paper on 
'The Concept of Mental Deficiency in Theory and Practice^', Psychiatric Bulletin 
1935, rX, pp. 232-236. 
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from a specific disease, but one whose general intellectual inadequacy 
allocates him to a social group whose level of functioning is such as to 
make it impossible for him to cope with his environment. Mental 
defectives are primarily individuals who because of lack of mental ability 
need special care, education and institutionalization. They are indi- 
viduals who for reasons other than special physical or mental disabil- 
ities (neuroses, psychoses, etc.) are unable to care for themselves. 

A mental defective is characterized not only by a lack of ability to 
care for himself but also by an incapacity to use effectively whatever 
abilities he does have. His actions are often not only senseless and 
inadequate but perverse and anti-social as well. He may be not only 
stupid but vicious, and the question arises why he is sometimes one and 
not the other? Is it due merely to the fact that he is unable to compre- 
hend the significance of his acts or is his perverse behavior due to other 
deficiencies? What we are, of course, asking by these questions is 
whether mental deficiency, is, as is commonly held, exclusively a matter 
of lack of intellectiial ability or whether it also includes inadequacies in 
other aggccts of the individuaPs endowment. 

"TTKe answer is that except in the case of individuals of very low intelli- 
gence (imbeciles and idiots) it is generally necessary to take into account 
factors other than sheer lack of intellectual ability in making a diagnosis 
of mental deficiency. This is particularly true of individuals who fall 
psychometrically into the moron, highgrade moron and borderline defec- 
tive groups. Here an I.Q. alone, however accurately determined, is 
often insufficient for a definitive classification. This is shown by the 
fact that there are many individuals with I.Q.^s above 70 wffiose behaviour 
is definitely defective. On the other hand there are individuals with 
I.Q.'s below 60 whose entire life history is that of a non-defective. 

The following case illustrates that of an individual with a relatively 
high I.Q. who must nevertheless be considered a mental defective: 

J. M., 26 years old, native white, was arrested on a charge of impairing the 
morals of a minor (a girl of 10 years). He is reported to have made si mil a r 
attempts on several previous occasions and had, for some years, been a persistent 
problem on this account. The family states: ''We have always kept an eye 
on him because we felt he would get into trouble.” 

Physical examination: Unattractive-looking youth who appears to be 
younger than he is. General physical and neurological examination as well as 
blood Wassermaim is native. 

Psychiatric e xamina tion: Appears dull and indifferent and childish. General 
reactions, immature. Is careless about person, but able to take care of himself. 
Diagnosis: mental (Mective-moroii. 
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Psychological examination: Stanford-Binet, 13 years 8 months, I.Q. (15 
years) 91. Scatter IX-XVI, inclusive. Except for designs (patient had bad 
vision), no failures below XII year level. ^ 

Work history: Patient was never able to find work for himself but a number 
of jobs were procured for him by his father. These he was unable to hold for 
any length of time. 

The case just cited is typical of many that come to the attention of 
any large clinic. Frequently the subjects are sex delinquents and, more 
often than not, individuals who have gotten into trouble with the law 
or otherwise proved themselves incapable of meeting the ordinary 
exigencies of ^social adjustment. On psychometric examinations they 
may rate dull-normal, or even_average. Judged by ^cial criteria they 
are mentally defective. 

That such a group exists has been long recognized by those dealing 
first hand with delinquent and socially inadequate individuals. They 
are patients whom Tredgold^ has called '^individuals lacking both in 
prudence and moral sense’' though yet suffering from no scholastic or 
educational disability. As Tredgold well points out, an individual 
falling into this group 'Tar from being illiterate, may have quite a good 
range of educational requirements. He may be nimble-witted, a good 
conversationalist, plausible in argument and be able to give a good 
account of himself.” In spite of this he is anti-social and requires super- 
vision for his own welfare as well as for the protection of others. What 
he lacks essentially is an ordinary adaptiveness or "wisdom in the 
moral sense”, and it is important to realize, as Tredgold insists, that 
such a person "is mentally defective in the usual and necessary sense of 
the term.” 

In contrast to the above, there is another group of individuals who 
systematically rate as mental defectives on mental tests, but, who can 
in no way be classified in this group when diagnosed on the basis of 
concrete social standards, i.e., in terms of capacity to adjust to the 
normal demands of their social and economic environment. They are 
frequently illiterate and, commonly, individuals coming from definite 
social strata where opportunity for education is small, and stultifying 
labor the general rule; many belong to what might be termed the 
"peasant type”, as indeed they frequently are by origin. However, 
the first case of this type that came to our attention was a native, white 
Oklahoman of 28, who had come up for individual psychological exam- 

» Tredgold, A. F. : MerUca Deficiency, 5th Ed., Baltimore, 19^. 
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inations because he had failed to pass the Army Alpha and Army Beta 
intelligence tests. On both Stanford-Binet and the Yerkes Point Scale 
he obtained a mental age of less than 8 years. Nevertheless, before 
entering the Army he had gotten along very well, was supporting 
a family, had been working as a skilled oil-driller for several years and, 
at the time of draft, was earning from $60 to $75 per week. Incidentally, 
he was making the grade as a soldier, and would not have come to the 
attention of the authorities had he not failed on the psychological tests. 

It would be possible to' cite many cases, ^ particularly from among 
adults of foreign birth, who rate systematically defective even on non- 
verbal tests, but whose social life history of at least an adequate adjust- 
ment contradicts this classification. Nor can we disregard at this point 
the question of negro intelligence test results, which if taken at face 
value, would necessitate our classifying over 40 per cent of that group 
as mental defectives. No one, we believe, at aU in contact with reality, 
would venture such a conclusion, however much suggested by tes^ 
results. In any case, actual clinical experience shows that there are 
certain individuals who regularly test as very inferior and frequently 
as mental defectives on standard psychometric tests but who, never- 
theless, are able to ad^^ and ^djust — ^make good housewives, adequate 
breadwinners, useful citizens and can and do get along not only in rural^ 
but urban environments. It is clear that these people are not mentally 
defective as defined by law or common sense. Nevertheless, by the 
tests available at present for measuring intellectual ability they are 
systematically selected as inferior individuals on a par with those who 
are recognized as needing institutional care as mental defectives. 

From facts such as these it seems evident that there is not one,Jbut 
sqyeral kinds of mental deficiency — ^that the concept of mental deficiency 
is far from a simple entity. There are at least two and probably three 
types of mental deficiency. The first is the intellectual defective, diag- 
nosable as such by the usual psychometric tests; the second, the social 
defective for whom the life history of the individual is the most satis- 
factory criterion; and, third — the emotional or ^'moraP’ defective whose 
precise definition is extremely diflScult to give, but whose existence, to 
any one who has had any first hand experience at a large clinic, is an 
observable reality. Between all three there is usually a certain degree 
of correlation, but this correlation is not sufficiently high to make any 
one an unfailing diagnostic indicator of the other. 

® Two such cases are cited in Ch. 9, p. 130 (Case D. W.) and p. 132 (Case M. B.). 



CHAPTER 6 

THE PROBLEM OF MENTAL DETERIORATION 

In speaking of mental deterioration much depends on what one 
means by ^^mentah^ and how one proceeds to define deterioration. 
By mental we shall mean primarily intdlectual abilities^ and by deterio- 
ration any conspicuous falling off or loss in these abilities. Concretely, 
a person will be considered as giving evidence of mental deterioration 
when he is no longer able to carry on his intellectual tasks with the 
speed, accuracy or efficiency previously characteristic of his functioning 
level. The only condition to be added is that the loss must not be one 
which is due merely to lack of practice. 

The definition of mental deterioration which we have just given is 
much broader than what is generally understood by it. Most psy- 
chiatrists and neurologists would not consider a mere falling off or 
loss in general capacity as an indication of mental deterioration. This 
is because they are primarily acquainted with mental impairment when 
associated with some organic or brain injury. But mental decline may 
and does occur independently of any specific mental disea^. It is 
characteristic not only of such conditions as general paresis, cerebral 
arteriosclerosis, or chronic alcoholism but of all senescent decline. 
Nevertheless, for diagnostic convenience, mental deterioration may be 
said to be of two sorts — that which occurs after maturity with the 
natural increase of age, and that which is consequent to some brain 
lesion or prolonged mental disease occurring at any age. Psychiatrists 
and neurologists have been primarily concerned with the latter and 
have almost entirely disregarded what for want of a better term, we 
shall refer to as “normal mental deterioration’'. Psychologically, 
however, there is little difference between the two, except as regards 
the rate at which deterioration occurs and, in the case of traumatic 
injuiy, as regards the number of mental functions involved. The 
deterioration met with in normal old people is similar to that met with 

^ This is an arbitrary delimitation of the term. Emption is no less mental than 
intellect, but the delimitation of the term when used in connection with deteriora- 
tion corresponds to what is generally implied by it in clinical practice. Thus 
psychiatrists distinguish **emotional’^ from **mental,^’ that is to say, * intellec- 
tual'* deterioration. 
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in most organic brain diseases. Senility,^ or extreme mental deteriora- 
tion, is merely a terminal state of certain processes which begin rela- 
tively early in life and continue progressively with age. The net result 
of the accompanying changes is to impair all original endowment. 
Whether we wish to reserve the term mental deterioration to cover 
only the extreme losses of ability or prefer to apply it equally to the 
entire senescent decline is a matter of convenience. For an under- 
standing of the impairment as a whole, however, it is essential that we 
have as full knowledge as possible of all changes which occur in human 
ability with age. It is these changes which we propose to discuss in 
this chapter. 

The changes in human capacities which occur with age fall into two 
self-limiting epochs: (1) The period during which they increase and 
(2) the period during which they fall off with age. The first of these 
is the well known period of growth which need not detain us here. The 
second period is that of gradual decline, regarding which there is still 
much difference of opinion but whose main trends have been sufficiently 
established to warrant a general description. We have elsewhere 
summed up the main facts regarding the decline of capacities with 
age® but for the sake of completeness will briefly summarize them again. 

Every human capacity after attaining a maximum begins an im- 
mediate decline. This decline is at first very slow but after a while 
increases perceptibly. The age at which the maximum is attained 
varies from ability to ability but seldom occurs beyond 30 and in most 
cases somewhere in the early 20 ’s. Once the decline begins it progresses 
continually. Between the ages of 30 and 60 it is more or less linear. 
In the case of most abilities the decline between these ages may be 
described with good approximation by an equation of the first degree. 
These facts are graphically summarized in figure 3. The bold-faced 
curve is the smoothed line of means of Bellevue intelligence test scores 
for subjects from age 15 to 65; the light curve, that for Vital Capacity 
measures for the same age period. Both have been reduced to com- 
parable scale units.** As will be seen the maximum of both curves 

* Senility, unlike “senile psychosis, “ is not a clinical entity. Even the latter 
is seldom used alone. Usually there is some qualifying condition added; thus: 
senile psychosis with arteriosclerosis, senile psychosis with Alzheimer's disease, 
and so on. 

® Wechsler, D.: The Range of Human Capacilies. Chapter VII. 

♦The test-score means were derived from an abbreviated 8-test scale; the 
actual means for the Full Bellevue Scale are given on page 118. Those for Vital 
Capacity will be found in an article by Ruger and Stoessiger, AnnaU of Eugenic^ 
1926, vol. II- , pp. 85 and 104. 
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occurs in the age period 20 to 25, with that of Intelligence somewhat 
earlier than that for Vital (Lung) Capacity. Another point of interest 
is that the Intelligence Test curve declines at a faster rate than the 
curve of Vital Capacity. This may seem strange but is not an artifact. 
Contrary to common belief many of our intellectual abilities show 
greater impairment with age than do our physical ones.^ The actual 
decline in any given case, however, varies with the ability in question. 

The curve of mental decline shown in figure 3* is a composite curve. 
It is composite, first, in the sense that the points used in plotting it 
have been derived from measurements obtained on large groups and 

VARIATION OF INTELLIGENCE SCORES AND VITAL CAPACITY 



not on single individuals. But it is also composite in the sense that 
the decline described represents the age changes in mental capacity, 
not with respect to any specific ability, but with respect to the average 
of a large number of abilities. The groups of abilities here considered 
are those represented by the tests which happen to constitute our 


5 The most conspicuous decline of all is shown in the sensory field. Vision and 
hearing fall off rapidly after 30. Their curves of decline, however, are far from 
straight lines. See on this point Huger and Stoessiger, op. ciL 

* See also figure 2, p. 29. The curve of mental decline shown by the heavy 
line in figure 3 has been * ‘reduced’' to make it comparable to the Vital Capacity 
curve given along side of it. It has also been “smoothed’'. 
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scale. A scale made up of a different set of tests might furnish an age 
curve which differed significantly from the one we obtained. To 
determine whether such were the case one would have to examine the 
same group of individuals with a large variety of scales, each as different 
from the other as possible. We were unable to do this. But, as a 
possible alternative approach, we compared the age curve obtained 
with the Full Scale with age curves obtained from combinations of 
only certain of the tests® and also with age curves obtained from the 
individual tests, taken separately. Individual age curves for six of 
our ten tests are given in figures 4 and 4A. 

As will be seen from their examination, the form of all these curves 
is the same as that of the generalized curve given in figure 3. The 
decline of any given ability, like that of any combination of them, is 
essentially linear. The main difference between them pertains to the 
age at which the initial decline begins and more particularly to the 
rate at which the decline proceeds. Different, mental abilities decline 
at different rates. That they do is psychometrically fortxmate, because 
these differences can be made use of in determining mental deterioration. 

The decline of mental ability with age is part of the general senescent 
proce^ of the organism as a whole. Hitherto the common view has 
been that our mental abilities, unlike our physical abilities, remain 
relatively unimpaired until rather late in life (senility), except as an 
occasional consequence of disease or traumatic injury. This was an 
unsubstantiated hypothesis tenable only so long as no facts were at 
hand to oppose it. But the view still persists even though such facts 
are now available. Most people, including scientists, hate to believe 
that they are not as mentally alert at 50 as they were at 20. Part of 
this is due to a confounding of mental with pr act^ al ability, that is, 
a failure to differentiate between intellectual endowment and the 
success had in applying it. The latter is naturally dependent in no 
small measure upon e xperien ce. What one has lost through a falling 
off of native ability one may often replace by acquired knowledge. 
An old clinician may be a better doctor than a younger one, even 
though he possess less actual understanding of disease processes. 

Another item which contributes to the biased attitude towards the 
facts of mental decline is the historical distinction between physical 
and mental. According to this distinction the latter is conceived of 
being higher, better or more important. Accordingly few people are 

* For example, curves obtained from verbal tests alone, or performance tests 
alone. 
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concerned when told that at 40 they cannot hear or fight as well as 
when they were 20, but are quite “het up” when informed that they 
probably also cannot calculate or reason as well. There also exists 
a kind of hierarchy of relative values as regards the various mental 

VARIATION OF SCORES ON DJFFERENT TESTS 
WITH AGE 



AGE 



abilities themselves. Professor Cattell long ago called attention to 
the fact that people are ever ready to complain about their bad memory, 
but seldom of their poor judgment or common sense. But it is certain 
that memory is not the only mental capacity which dechnes \\dth age, 
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nor the one which is always most impaired by it. Conrad and Jones, ^ 
for example, found that older people (individuals over 50) did much 
better on the Army Alpha Test calling for general information, than on 
the one calling for abstract reasoning (the Analogies Test), and we our- 
selves have found that they do better on repeating digits forwards than 
in detecting similarities. 

We have put forward the hypothesis that the decline of mental 
ability with age is part of the general organic process w^hich constitutes 
the universal phenomenon of senescence, and have insisted upon the fact 
that the phenomenon begins relatively early in life. The evidence we 
have adduced for this hypothesis is the observed parallelism we have 
found in the rate of decline of various ph 3 ^sical and mental abilities. 
Another line of evidence is furnished by neurological studies of the 
brains of senile individuals. These are not always clear-cut cases 
because they may be associated with special trauma or organic disease. 
If our hypothesis is correct, however, the same changes, though of 
course to a lesser degree, ought to be expected much earlier in life, say 
from 20 years onward. Up to the present, experimental data supporting 
this view have been lacking. This may be due to the fact that neuron- 
pathology has not been particularly alive to the problem, or, as is more 
probable, to the fact that neurological techniques are not sufficiently 
advanced to detect very small and gradual alterations^ which may, and 
xmdoubtedly do, occur in brain tissue. There is, however, some in- 
direct evidence that alteration of the brain begins at an early age, 
which may be considered as supporting our view. We refer to the 
progressive change in brain weight with age. 

It has been known for some time® that the mean weight of the adult 
brain declines with age. The skull thickens and the brain shrinks. 
It we accept the brain as the organ of the mind, it is only reasonable 
to assume that even gross changes such as alterations of weight may also 
affect its function. Assuming that to be the case, we should expect 
that the changes in brain weight show some concomitance with altera- 
tions in general intellectual ability , — a fact which we thought might be 
demonstrated by an analysis of both in relation to the age factor. 
The graphs in figure 5 represent the results of such a study. They 

^ Genet. Psychol. Monog. 1933, pp, 260-261. 

« Probably because most of these are atomic or molecular and therefore not 
discernible under the microscope. 

* Pearl, R.: Variations and correlation in brain weight. BioTneL, 1905, voL 
IV, pp. 13-104. 
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are the age curves for intellectual ability and brain weight for ages 15 
to 65. Necessarily they were not obtained from the same subjects.^ 
The age curve for intellectual ability is that obtained for subjects 
given the tests which were used in standardizing our Bellevue Scale; 
the brain weight curve is that computed from figures for brain weights 
of autopsied subjects of comparable ages, collected by Rossle and 
Routlet.^*^ For purposes of comparison, the curves have been reduced 
to approximately the same scale. Inspection of these curves reveals a 
close parallelism between loss in brain weight and decline in mental 

VARIATION OF INTELLIGENCE TEST SCORES AND 
BRAIN WEIGHTS WITH AGE 



Fig. 5 

ability with age. It is probable that, with a different series of brain 
weights and a different series of tests, the similarity between the two 
curves would not be so marked. But the general parallelism caimot 
be ascribed either to chance or artifact. It is much better than that 
found between most physical and mental traits and as good as that 
foimd between curves for many mental abilities taken separately. 

Data given in Wertham^s, F.: The Brain as an Organ. New York, 1934. 
The brain weights are those of males 16 years and over, averaged for five year 
periods. 
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The decline in the weight of brain, like that in intellectual ability, is 
essentially linear.^^ After reaching a maximum, at about age 20, the 
brain begins to decline in weight, somewhat irregularly at first, but 
after 25 or 30 at a fairly constant rate. 

If our generalizations regarding the influence of age on mental de- 
terioration are correct, it follows that mental deterioration in human 
beings begins at a relatively early age. However, to speak of a person 
in his early 30 ^s as showing signs of deterioration because he already 
manifests some measurable decline of ability, would be stretching the 
term beyond its usual connotation. Clearly that is not what is implied 
by deterioration in the ordinary sense. It would seem desirable to re- 
strict the term to only such impairment or losses in ability as are 
significantly greater than those due to the age factor alone. To do 
this we must necessarily know what the n orm al loss of ability is for the 
average individual and the mean limits of variability at all ages for 
the normal population. But even with these facts at hand we are only 
at the beginning of our task. To evaluate deterioration we must be 
able to measure it quantitativel 3 ^ 

The measurement of mental deterioration involves three separate 
problems: (1) The reliable measurement of the individuaFs actual or 
present functioning ability; (2) the evaluation of his previous fimc- 
tioning level; (3) the expression of the difference between the two in 
meaningful, quantitative terms. Up to the present, none of the three 
problems has had complete solution. We say none, although as regards 
the measurement of present functioning ability this statement is not 
altogether correct. We do have considerable means of measuring an 
individuaFs functioning ability, but these means, in general, are un- 
suitable for precisely those groups on whom they are most needed 
in studying mental deterioration, namely, the older adults- What 
we have said regarding the inadequacy of available general intelligence 
scales for the measurement of adult intelligence, holds equally for 
individual tests of mental ability. They have seldom been standardized 
for adults. The Bellevue Scales presented in this book are an attempt 
to furnish the necessaiy adult norms for such tests. By means of the 
tests which compose them we are now able to measure a fairly large 
number of intellectual abilities throughout the greater part of adult 
life, that is, up to the age qf 60, and in some instances up to the age of 70. 

The second step in the nieasurement of deterioration, namely, the 
evaluation of an individuaFs previous functioning level, presents a 

For further evidence on this point see Pearl, R. : SttLdies in Human Biology. 
Baltimore, 1924, pp. 58-65, 
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much more dijEcult problem. The reason is that in most cases we have 
no psychometric data by which such evaluation can be made. Few 
persons examined for mental deterioration have ever had a previous 
psychological examination. Yet, in order to make an accurate estimate 
of a person’s normal functioning ability, we should have not only one 
but a series of ps^^'chometric examinations on him, done at various 
intervals and so far as possible with the same or comparable tests. 
That, however, is an ideal situation which cannot be realized at present. 

In practice, it is necessary to turn to other sources -of data that will 
enable us to appraise the subject’s previous functioning ability. The 
data usually consist of facts gleaned from the subject’s educational, 
vocational and social history. Thus, if from a subject’s history we 
learn that he is a high school graduate, that he has held responsible 
positions for a number of years and that he attained some social recogni- 
tion in his community, we can safely assume that he must have been a 
person of at least average endowment and that he should be able to 
perform certain mental operations which may be expected of individuals 
of comparable endowment. But information of this sort is of value only 
where the discrepancy between an individual’s actual and expected 
functioning is considerable. Thus, if a formerly successful business man 
at age 45 attains an I.Q. of only 70, is unable to do simple calculations 
and can no longer repeat 5 digits, we should know at once that he has 
deteriorated. Degrees of deterioration of this sort, however, can ordi- 
narily be detected without standardized psychometric examination. 

We do not need tests to discover deterioration in an old paretic. 
The merit of psychometric tests is that they can show us small differ- 
ences of ability, and their usefulness in the clinical field depends upon 
how well they can do this. Their value in determining mental deteriora- 
tion must reside in their ability to detect changes in mental functioning 
long before thej^ have so disorganized the behavior of the individual 
as to make them patent to all. For such purposes a general social and 
psychiatric history is insufi&cient as a base for comparison. We must 
either have psychometric records of previous examinations with the 
same or similar tests, or be able to use the results of the mental tests 
given at any specific examination as a means of inferring the probable 
previous ability of the individual. However, as already mentioned, 
few adults who come up for psychometric examination have had previous 
mental tests, and even fewer were done sufficiently long ago to enable us 
to make a comparison between their present and their former function- 
ing. It is here that the age curves for separate abilities which we give 
in figures 4 and 4A are of value. 
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One of the interesting facts revealed by the age curves for different 
abilities obtained on the same groups of individuals, is that certain 
abilities decline more slowly with age than others. Thus, the abilities 
called for by the General Information and General Comprehension 
Tests hold up much better than do the abilities called for by the Sub- 
stitution and the Memory Span for Digits Tests. This difference in 
rate of decline of various abilities suggests a possibility of estimating 
previous functioning levels. Thus, if the abilities which do not decline 
significantly with age vrere precisely those which were least affected 
by the deteriorative process, one could assume that scores which in- 
dividuals attain on tests measuring these abilities represented their 
original or permanent endowment. If now we combined a number of 
these tests into a rounded scale, such a scale would furnish us a means of 
measuring mental deterioration as well as past functioning levels in 
terms of present functioning abilities. All that would be necessary 
w^ould be to compare the mean score which a subject attained on the 
tests which remain relatively unimpaired with age, with the rating which 
he obtains on a group of tests which are relatively much impaired with 
age. The ratio or difference between their rates of decline would give 
us the required measure. 

The method outlined above for comparing previous and present 
functioning ability in terms of test scores obtained in a single examina- 
tion of a subject may be termed the differential-iest-score method of 
measuring mental deterioration.'^ ^ have just seen, it makes use 
of the fact that some abilities decline relatively little during adult life 
and others to a considerable degree, and assumes that the difference 
between their rates of decline in any given individual expresses his 
relative degree of deterioration. The usefulness of the differential- 
test-score method for evaluating deterioration necessarily depends 
upon the availability of tests with full age norms. Ideally, we should 
have to have available age curves for many different abilities, each 
measured in as many ways as possible ^vith tests whose validity and 
effectiveness had been previously established. At present such tests 
are all too few, but the eleven subtests made available by the stand- 
ardization of the Wechsler-Bellevue offers the possibility of trying out 
the method just discussed. 

Some attempt to use methods like the one just outlined has already been 
made by a number of psychologists. (Babcock, H., Arch, of Psychol. j 1930, 
no. 117; Gilbert, J. D., Arch, of Psychol.^ 1935, no. 188; Shipley, W.C.,J.Psyc/M)h, 
1940, 9, 371-377; Hunt, H., J. Aypl. Psychol., 1943, 5, 375-387.) 
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The first step in the application of the differential test score method 
is the optimal allocation of tests to the “Hold’^ vs. ‘^Don^t Hold'’ groups. 
In general, the tests which drop most markedly with age belong, as 
might be expected, vith the ^^Don't Hold" and those which drop least, 
with the “Hold" tests. But another factor must also be considered, 
namely, the type of ability measured by the tests. There must be some 
fxmctional similarity betw^een the opposed or contrasted tests. If this 
is not the case, w’e risk making test combinations w-hich, though showing 
significant differences, are likely to give spurious discriminants. For 
example, most abilities tapped by performance tests decline much more 
rapidly with age than those involved in verbal tests. At first thought 
it might appear that this situation could be capitalized by using verbal 
tests against performance tests as a ready means of obtaining measures 
of “Hold" vs. “Don’t Hold" abilities. This, howwer, would be a mis- 
take because, if we accepted differences between the scores on verbal 
and performance tests as a criterion of deterioration, all individuals 
who have relatively good verbal capacity would inevitably show greater 
mental deterioration as they grew older. This w^ould follow from the 
fact that the differences between verbal and performance abilities au- 
tomatically increase with age. On the other hand, people who are 
relatively good in performance tests would show lesser deterioration or 
indeed none at all, because the discrepancy between their verbal and 
I)erformance scores w^ould become smaller and smaller as they grew older. 
Accordingly, in assembling our “Hold" vs. “Don’t Hold" batteries we 
have brought together approximately the same number of verbal as 
performance tests. The tests combined in each of the proposed cate- 
gories are as follows: 

Tests which hold up with age 
Information 
Comprehension 
Object Assembly 
Picture Completion 
Vocabulary 

Tests which do not hold up with age 
Digit Span 
Arithmetic 
Digit Symbol 
Block Design 
Similarities 

(Picture Arrangement) 
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To obtain a measure of deterioration, one compares the sum weighted 
scores of the “Hold’’ tests with that of the “Don’t Hold” tests, allowing 
for the difference in number of tests in each group, or for a more strict 
comparison the sum of the first four tests of the “Hold” with the first 
four tests of the “Don’t Hold” group. The resulting comparison may be 
expressed either as a ratio or difference between the two sums. Nat- 
urally, if the result is given as a difference it must be expressed as a 
percent difference in order to take into account the absolute magnitude 
of the sums compared. Thus, if the sum of a subject’s “Hold” subtest 
scores is 50 and the sum of his “Don’t Hold” subtest scores 40, he shows 
a deterioration loss of 20% and efficiency quotient of .80. Deterioration 
is indicated if the percent loss is considerable or the efficiency quotient 
low, that is considerably below 100. 

The problem, of course, is to define what is meant by considerable 
or significant loss. In the long run such a definition would have to be 
statistical in character and be based on a distribution of individual per- 
cents of loss or calculated deterioration quotients of a sufficiently large 
number of cases both normal and pathological. These data are not yet 
available, but it is possible to arrive at effective approximations by the 
use of tables 39 and 40 given in appendix 2. This table contains the 
mean scores of all the subtests at successive age periods. Summing the 
subtest scores of the tests composing each of the batteries compared, 
we can obtain the total weighted score that may be expected for the 
“Hold” and “Don’t Hold” batteries at different ages. Next, it only 
remains to treat the resulting sums in the manner described above, and 
obtain what may be termed “normal” or average deterioration expect- 
ancies. These are given in tables 6 and 6A. 

Table 6 was obtained as follows: at each age level sums were obtained 
of the mean test scores of the Information, Comprehension (or Vocab- 
ulary^^), Picture Completion and Object Assembly tests, (Hold tests), 
and of the Digit Span, Arithmetic, Block design and Digit Symbol tests, 
respectively. The sum of the “Don’t Hold” tests at each level was then 
subtracted from the sum of the “Hold” tests and the difference divided 
by the minuend. The result is expected normal percent deterioration 
loss. If instead of expressing deterioration in terms of a difference one 
wishes to express it as a ratio the sum of the “Don’t Hold” tests is 

In practice it will be found safer to use the Vocabulary than the Comprehen- 
sion test. 
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divided directly by the sum of the “Hold/’ and the results so obtained 
constitute deterioration quotients. These are given in table 6A. 

We may now define deterioration in the pathological sense: An indi- 
vidual may be said to show signs of possible deterioration if he sho'ws a 
greater than a 10 percent loss, and of definite deterioration if a loss 
gi'eater than 20 percent^^ than that allowed for by the normal decline 
with age. Obviously the larger the loss the greater the probability that 
we are dealing with true deterioration. 


TABLE 6 

Average {Normal) Deterioration Loss at Different Ages 
(In percent) 



20-24 

25-29 

30-34 

35-39 1 

40-44 

45-49 

50-54 

55-59 

Calculated Values 

0.5 

0 

4 

i 

4 i 

9 

11 

14 

16 

Smoothed Values 

0 

1 

3 

5 

8 

11 

14 

16 


TABLE 6A 


Average {Normal) Deterioration Quotients at Different Ages 


AGES 


pon’thold -j- hold X 100 



20-24 ! 

25-29 

30-34 

35-39 

40-44 

45-49 

50-54 ; 

55-59 

j 

Calculated Values i 

99.5 

100 

96 

96 

91 

89 

86 ! 

84 

Smoothed Values 

100 

99 

97 

95 

92 

89 

86 ; 

84 


We shall now present a few applications of the procedure described 
above, restricting ourselves to the percent of loss technique as the pre- 
ferred method. 

On page 161 is cited a case of a man age 54 with symptoms of head- 
ache, dizziness and history of an old skuU fracture. The case was used 
to illustrate the organic brain impairment pattern. But the psycho- 
metric also serves to illustrate what is not so obvious, a test score distri- 
bution indicating marked mental deterioration. This is not suggested 
by the subject’s test performance as a whole, since his I.Q. is 103 and 

Losses of 10 and 20 percent correspond roughly to deviations of —1 P.E. and 
—2 P.E. from the mean at age 20-25. The mean loss at this age is, of course, 0. 
The equivalents at other ages may be assumed to be roughly of the same order, 
but the percent of loss must of course be calculated from the altered means at 
these ages (see table 6). 
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shows no discrepancy between his total Verbal and total Performance 
ratings. If, however, we compare his scores on the *^Hold^’ and “Don’t 
Hold” tests we find the following: Information — 11, Comprehension — 
10, Object Assembly-12, Picture Completion- 10, Arithmetic-7, Digit 
Span-6, Block Design-7. The sum of the “Hold” tests is 43, that of the 
“Don’t Hold” 24; from which by appl3dng the method outlined above, the 


/43 — 24 \ 

deterioration loss f — — — j, is 


is found to be 44 percent or, allowance 

being made for his normal decline with age (see table 6), a net loss of 30 
percent. This is obviously considerable ; actually, the subject was unable 
to continue at his job. The method just described will be most useful 
in deterrnining mental deterioration in organic brain cases, but it may 
also be applied in cases of Schizophrenia. Here special considerations 
arising from the peculiarities in test patterning of this group must be 
borne in mind. Thus, the not uncommon low^ score of the Schizophrenic 
on the Comprehension and still lower ones on the Picture Completion 
test (see Case S-2); or, again, the fact that schizophrenic patients do 
systematically worse on the Object Assembly than on the Block De- 
sign, and so on. 

Another possible application of the method is in the appraisal of the 
reported effects of shock treatment, negative as well as positive. Here 
in addition to some of the considerations already mentioned, there is 
also the problem of practice effect inherent in all retest procedures, and 
in particular its differential effect on the various tests used. But per- 
haps the most serious limitation to the method as a whole is the assumed 
very high reliability of the tests themselves. Such high order of re- 
liability does not in fact exist. For some of the tests, such as the Vo- 
cabulary, it is reassuringly high and perhaps sufficient, but for most of 
the others no such claim can be made. Nevertheless the method does 
prove effective in practice, and if the limitations discussed are borne in 
mind, may be applied wdth reasonable confidence. 

A particular point worth calling attention to is the frequency wnth 
which certain function s are involved in the deterioration of mentally ill 
patients irrespective of the disease entity. Thus alcoholics (Korsakoff 
type), paretics, seniles and schizophrenics all do badly on Digits Back- 
wards, Hard Associations, Reproducing Designs and the Digit Symbol 
Test. They do not, however, do so for the same reason. The paretic 
and alcoholic fail on repetition of Digits Backwards and on the Digit 
Symbol Test because of definite retention and learning disabilities; the 
dementia praecox patient because of poor attention, self-preoccupation 
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and general inability to concentrate,^^ Nevertheless, from certain 
practical points of view, the disabilities may amount to the same thing. 
Both types of patients cannot be trusted to execute simple tasks or to 
follow complicated directions. 

This brings up again the interesting question of the nature of the 
deteriorative processes. There is no doubt that, as a result of certain 
traximatic injuries, individuals show special disabilities. But the 
recurrence of like disabilities in different traumatic injuries strongly 
suggest the generalized nature of the deteriorative process. The 
brain acts as a whole and its impairment in one place affects its func- 
tioning^ as a whore. The fact that its disorganization is greater along 
one rather than another line is of secondary importance. For this 
reason it is perhaps not particularly important what tests are used in 
detecting it, providing they are such tests as are, especially sensitive 
to the patient’s intellectual functioning. 

In general, mental deterioration is best revealed by measuring speed 
of response, learning and the ability to perceive new configurations, 
particularly spacial ones. Ordinarily, it is not the form of the test 
but the function it calls forth that determines its diagnostic value. 
One can measure speed of response almost equally ■well by counting 
the number of wmrds the subject gives in three minutes, or by the num- 
ber of A’s he can cancel on a printed page in a like period. But it makes 
considerable difference when we study learning ability, whether we use 
“old’' or “new” paired associates. Chronic alcoholics, for example, 
tested with a list of paired associates such as come — go and Boston — 
Massachitsetts show little impairment, but when tested with a list of 
hard paired associates like crush — dark and Appleby — Texas will show 
considerable impairment. The reason is that while the test is of the 
same form, the functions involved are quite different. One requires 
the formation of new, the other merely the recall of old associations. 

We have defined deterioration primarily as an impairment or loss of 
function. Sometimes this impairment manifests itself as a disorganiza- 
tion, sometimes as a mere regression to primitive types of response. 
Both occur in various types of organic brain disease. The disturbances 
met with in the visual motor field are particularly striking and have 
been extensively studied by Lauretta Bender by use of the Wertheimer 

This is the usual psychiatric interpretation. For a very different explana- 
tion, see Babcock, H. : Dementia Praecox. N. Y,, 1935. 
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figures which she has standardized into a visual-motor Gestalt test.^® 
We have applied her test to patients variously diagnosed as seniles, 
and found that the figures are reproduced by these patients as primitive 
loops or segments of large arcs, perseverated in a wavelike manner.^^ 
This is precisely the kind of thing that Bender found to be characteristic 
of children’s efforts prior to the maturation of the psychomotor func- 
tion, (at M.A. level of approximately 2 years) and in particular of men- 
tally defective children of about this M.A. level. Veritably we must 
add another stage of Shakespeare’s Seven Ages, the age of just plain 
sans everything. 

Bender, L. : The Visual-motor Gestalt Test and Its Clinical Use. Research 
Monographs f Amer. Ortho. Psych. Assoc., New York, 1938, particularly Chapter 
VIII. 

It is interesting to note that these changes are precisely those observed in 
patients with severe sensory aphasia. Seniles are, of course, frequently aphasic. 
Cf. Bender, L.; Arch, of Neur. & Psych., 1933, XXX, 514-537. 
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THE BELLEVUE INTELLIGENCE SCALES 




CHAPTER 7 

SELECTION AND DESCRIPTION OF TESTS 


The first problem that confronts any one attempting to devise an 
intelligence scale is that of deciding upon the tests that should be in- 
cluded in the battery. This task is not a simple one, for, in addition 
to the necessity for fulfilling certain statistical criteria, there are a 
number of general considerations, which, independent of all other 
factors, restrict one^s choice to a greater or lesser degree. One of these 
is the author’s defined or implied view as to the nature of intelligence. 
Thus, if he believes that intelligence involves primarily the ability to 
perceive logical relations and to use symbols, he is very likely to favor 
tests calling Tor verbal, arithmetical and, in general, abstract reasoning 
abilities. If he believes intelligence also involves abilities to handle 
^‘practical situations”, he is very likely to include at least some tests 
calling for performance and manipulative abilities. 

The choice of tests is further restricted by the special requirements 
of the various types of scales themselves. Certain tests, for example, 
which are suitable for age scales cannot be used satisfactorily for point 
scales and vice versa. Thus items involving psychomotor ability, 
such as tying a bowknot and copying a diamond, are found excellent 
for age scales, but are almost useless for continuous point scales not 
only because of their limited range but by virtue of the fact that by 
increasing their complexity one alters the type of ability measured by 
the tests. For example, at age seven, ability to copy a diamond is a 
very good indication of the child’s intelligence, but if we increase the 
complexity of this task, say by demanding the reproduction of a bi- 
sected rhomboid, we succeed in making the test more difficult but add 
little to its discriminative value, as a test of intelligence. And if we 
proceed to make the task still more difficult, we are likely to wind up 
with a test that measures primarily some specialized or even “new” 
ability. In this connection, it may be noted that the same difficulty 
inheres in all tests to a greater or lesser degree. Beyond certain points 
every test ceases to be an effective measure of the capacity which it 
was originally designed to measure either because superadded factors 
begin to enter into the relationship or because the curve of its measured 
function tends to reach an asymptotic level. Thus memory span for 
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digits correlates with general intelligence up to a certain point, but 
beyond the ability to repeat 6 or 7 digits forward, its correlation becomes 
practically negligible. On the other hand, many performance tests 
(for example, the Ship Test) cease to be a measure of intelligence at 
about the 12 year level, because above this age, maximum scores are 
obtained by most subjects. 

What is true for tests taken individually is true of them when com- 
bined into scales. Every mental scale eventually reaches a point or 
level beyond which increasing scores on it show relatively little correla- 
tion with intelligence as originally conceived. That is inevitable. 
But, of course, different scales may, and do, differ considerably as 
regards the location of the point where they cease to be effective measures 
of intelligence. Naturally, every author of a scale seeks to extend this 
limit as far as possible, but if the range of the scale is at all wide this 
desire to extend its limits entails serious restrictions in the choice of 
material. For it is unfortunately true that those tests which dis- 
criminate well at low’er levels of intelligence generally do not do so at 
upper levels, and vice versa. It is for this reason, among others, that 
our scale does not go below the seventh year. We were unable to 
find any battery of tests which would discriminate below that year and 
still be valid for use with adults. 

Yet another factor limiting the choice of possible test material is the 
degree to which one is ready to bow to the dicta ofjexperience. It may 
seem like insisting on the obvious to assert that the worthw'hi’eness of a 
test can be determined only by the way it shows up when actually tried 
out. It is surprising, however, how many tests that form part and 
parcel of most scales turn out to be poor indicators of intelligence when 
checked against clinical experience. A long list of such tests could 
readily be made. The thing that first calls onc^s attention to the 
possible inadequacy of these tests is that one often finds stupid people 
doing well and superior people not infrequently failing on them. These 
include such tasks as giving rhymes, checking letters, making designs, 
defining unusual words, repeating long series of numbers, putting 
together formboards, to mention only a few of the t3pes of tasks fre- 
quently offered as tests of intelligence. It is the inclusion of items like 
these in our scales that often arouses the scepticism of the laity towards 
intelligence tests as a whole. This general scepticism is unwarranted, 
but it is unquestionably true that the purely statistical criterion of 
passing or failing an item at a given age level, is far from sufificient 
ground for accepting it as a valid measure of intelligence. Thus the 
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fact that approximately as large a percentage of adults succeed in 
repeating 8 digits forward as in solving the ingenuity problem of year 
XVIII of the Stanford-Binet in no way proves that they are equally 
good tests of intelligence. In 15 years of continuous testing, we have 
not found a single person of inferior intelligence who has succeeded on 
the latter, though we have seen scores of mental defectives passing the 
former; and most clinicians, we believe, can testify to the truth of this 
observation. 

The point we have been making is that all tests, in the long run, must 
be subjected to some effective experimental check-up, or practical 
criterion of validation. This pragmatic criterion of practical validity 
seems to have been insufficiently emphasized by the makers of test 
scales. Altogether too many items used in most intelligence scales 
have been included on the basis of purely statistical and a priori grounds. 
Unfortunately, this method of choosing test items is far from infallible. 
The only way one can know whether a test item is a really “good"' 
measure of intelligence is by actually trying it out. 

A priori prognostication is a very risky business, particularly with 
individual test items, as the following, from our own experience, shows. 
In collecting the items to be used for our Information Test, we included 
the questions, “How many pints make a quart?’^ and “How many feet 
make a yard?^^ as possible items for the lower end of the series. Both 
are questions commonly met with in various information tests, es- 
pecially in the unstandardized ones of psychiatrists. Seemingly a 
knowledge of the correct reply is regarded as having some value by 
those who use these questions. The reader may agree or disagree as to 
their value, but it is unlikely that if he were asked to pass judgment on 
their respective merit, that he would rate either question very much 
better or very much worse than the other. In this, however, he would 
be mistaken. We submitted these two questions to several groups of 
known intelligence level and calculated the frequency at which each 
was passed by the several groups. Our data then showed that while 
the question, “How many pints make a quart?'' was fairly good in the 
sense that it gave significant, discriminating differences between mental 
defectiveness, borderline and dull-normal individuals, the question, 
“How many feet make a yard?" w^as practically worthless, because the 
differences were small and negligible. Certainly no one would have 
predicted this result in advance. Now that we have discovered it, 
we must frankly admit we can offer no good reason for the fact. 

The point of our illustration is that validation of any intelligence 
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scale should be pragmatic. The scale must be not only statistically 
reliable, but discriminative with respect to the very thing we propose 
to measure (in our case, an intelligence level); and its validity can be 
determined only through actual experience. Ideally, the pragmatic 
test should be made, not only for the scale as a whole, but for every item 
that enters into it. In reality no intelligence scale (including our own) 
fully measures up to this requirement. We think, however, that the 
tests that constitute our scales conform more nearly to this ideal than 
do those of any adult scale now available. One reason we have been 
able to do this is that we have drawn so heavily on the experience of 
others. Our aim was not to produce a set of brand new tests but to 
select, from whatever source available, such a combination of them as 
would best meet the requirements of an effective adult scale. 

In arriving at our final choice of tests we pursued the follovring pro- 
cedure; (1) A careful analysis vras made of the various standardized 
tests of intelligence now in use. These were studied with special atten- 
tion to authors’ comments with reference to the type of functions 
measured, the character of the population on which the scales were 
originally standardized, and the evidence of the test’s reliability. (2) 
An attempt was made to evaluate each test’s claim to validity as 
evidenced by its degree of correlation (a) with other recognized tests 
and, (6) more important still, with subjective ratings of intelligence. 
The last named included teachers’ estimates, ratings by army officers 
(as in the case of the Army Alpha and Beta), and estimates of businep 
executives (in the case of various tests which had been tried out in 
industry) . (3) An attempt was made to rate the tests on the basis both 

of our own clinical experience and of that of others. (4) Some two 
years were devoted to the preliminary experimental work of trying out 
various likely tests on several groups of known intelligence level.^ 

On the basis of the data obtained from the above procedure, we 
selected 12 tests, the 11 presently to be described and the Cube Analy- 
sis.* These were given to the various populations to be described in 
the next chapter, and they form the basis of our several scales. The 
Cube Analysis test was discarded after being given to over 1,000 sub- 
jects because it showed lai^e sex differences, proved difficult to get 
across to subjects of inferior intelligence, and because it tapered off 
abruptly at the upper levels. On the other hand, the Vocabulary Test 
was not added imtil a major portion of our subjects had already been 

‘ Groups of 25 to 50 on whom we had intelligence ratings. 

*Test 3 of the Army Beta. 
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examined, and for this reason was used for some time only as an al- 
ternate test. 

The tests finally chosen were: 

1. An Information Test 

2. A General Comprehension Test 

3. A Combined Memory Span Test for Digits 

Forwards and Backwards 

4. A Similarities Test 

5. An Arithmetical Reasoning Test 

6. A Picture Arrangement Test 

7. A Picture Completion Test 

8. A Block Design Test 

9. An Object Assembly Test 

10. A Digit Symbol Test 
Alternate — ^A Vocabulary Test 

These tests have been combined to form four separate but interrelated 
Intelligence Scales, as follows: 

I. The main Individtuil Adidt Examination for ages 16 to 60, consisting 
of the first ten tests listed, but permitting a reduction of the number 
to as few as seven, depending upon their suitability to the subject. 

II. An Adolescent Scale for ages 10 to 16, consisting of the same tests but 
separately standardized. 

III. A Performance Scale consisting of five tests (tests 6 to 10 inclusive). 
TV. A Verbal Scale consisting of five or six tests (test 1 to 5 and the Vocabu- 
lary as alternate). 

By the ‘^Full Scale” is meant either Scale I or Scale II, each of which 
consists of ten tests, plus the Vocabulary as an alternate. We shall 
now proceed to describe the individual tests which enter into any and 
all of the scales listed. 


INFORMATION TEST 

Questions formulated to tap the subject's range of information have, 
for a long time, been the stock in trade of psychiatric examinations, 
and prior to the introduction of standardized intelligence tests they 
were widely used by psychiatrists in estimating the intellectual level of 
patients. Psychologists, however, have generally been inclined to ex- 
clude rather than to make use of information items when devising 
intelligence scales. It was not until the development of the group test 
that such items found their way into the standardized intelligence 
examinations. It is probable, too, that their use here was largely in- 
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Spired by practical considerations, such as the relative ease with which 
they lend themselves to scoring, rather than to any strong faith which 
psychologists may have had in the information tests as a good measure 
of intelligence. One had always to meet the obvious objection that 
the amount of knowledge which a person possesses depends in no small 
degree upon his education and cultural opportunities. The objection 
is a valid one, but our experience with the test has shown that it need 
not necessarily be a fatal or even a serious one. Much depends upon 
the kind of knowledge demanded of the subject and the type of question 
used in eliciting it. 

The first strong support for range-of-information as a good measure 
of intelligence was furnished by the data obtained from the Army Alpha 
Examination. When the individual tests of the Army Alpha battery 
were analyzed with regard to their correlation with various estimates of 
intelligence, the information test, to the great surprise of many, turned 
out to be one of the best of the entire series. It correlated, for example, 
much better with the total score than did the Arithmetical Reasoning, 
the test of Disarranged Sentences, and even the Analogies Test, all of 
which had generally been considered much better tests of intelligence. 
Compared wdth the other tests on Alpha, the Information Test gave a 
much better distribution curve, showed a relatively small percentage of 
zero scores, and showed little tendency towards piling up maximal scores 
at the upper end. All this could not have been an accident, particularly 
in view of the fact that the individual items on the Alpha Information 
Test left much to be desired. The fact is, all objections allowed for, 
the range of a man’s knowledge is generally a very good indication of 
his intellectual capacity. 

In practice, the value of an information test will depend in a large 
measure on the actual items which are included in it. There are no 
universal principles which can serve as unfailing guides to '%ood” ques- 
tions. In general, the items should call for the sort of knowledge that 
an average individual with average opportunity may be able to acquire 
for himself. Thus ^‘What is the height of the average A^ierican 
woman?"’ is a much better question than, ''What state produces the 
most gold?” "How far is it from New York to Paris?” much better 
than "What is the distance from the earth to the sun?”, etc. In 
general too specialized and academic knowledge is best avoided. 
"What does ensilage mean?” and "What is a tetrahedronV^ are poor 
questions even for upper levels of intelligence. So are historical dates, 
names of famous people, whether of statesmen or movie actresses. But 
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there are many exceptions to the rule, and in the long run each item 
must be tried out separately. 

The twenty-five questions on our list are the final selection from some 
75 which we tried out originally. The method employed in choosing 
the items was to present the questions, generally in sets of 25 to 30, to 
groups of individuals of known intelligence level.® Selection of the 
items was then made on the basis of the incidence of successes and 
failures among the various groups. A question was held to be a “good’^ 
one if it showed increasing frequency of success with higher intellectual 
level. Of course, not all questions were equally discriminative at all 
levels. Thus the question, ‘Where is London?'’ discriminates well 
between mental defectives and the borderline group and not at all 
between the average and superior. On the other hand, “What is the 
Koran?” does not discriminate at all at the lower levels (since practi- 
cally every individual there failed it), but showed quite significant 
differences between the respective per cent of average and superior ^ 
individuals who passed it. 

The order in which the questions are listed roughly approximates 
their order of diflSculty for our sample of the population. No doubt, in 
different localities, the order will be somewhat different; it will also be 
affected to some extent by differences of national origin. “Where is 
Rome?” will be passed almost universally by persons of Italian origin 
irrespective of their intellectual ability. More interesting than these 
expected variations are some very surprising results with respect to 
the difficulty of certain of the items. The question, “W^hat is the popu- 
lation of the United States?” turns out to be inexplicably hard. It is 
surprising how many native Americans do not know even the approxi- 
mate number of inhabitants of their own country. We have had 
estimates by college graduates from 10 to 300 million. On the other 
hand, more people can tell what a thermometer is than how many weeks 
there are in a year, and more can give the name of the inventor of the 
aeroplane than can give the author of Hamlet. 

We shall now quote some comments about the test made by our 
examiners in the field, which will indicate some of the test's advantagea 
and some of its limitations. “The test is of value because it gives the 
subject's general range of information”; “It often indicates the alertness 
of the person towards the world about him”; “It may reflect the social 
circle a person comes from; children from educated and intellectual 

* Individuals for whom we had I.Q.’s or other intelligence ratings. 
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families more often give the correct answer to the question, ‘Who wrote 
HamletV “It presupposes a normal or average opportunity to receive 
verbal information. It is a poor test for those deprived of such oppor- 
tunity as well as for those who have a foreign language handicap.’’ 

AJtogether, the Information Test has proved one of the most satis- 
factory of our battery. It declines least with age and correlates second 
best with total score, (“r” = .67 ±.020).'^ Interestingly enough, it does 
not correlate as highly with rote memory (Digit Span) as it does with 
some of the purely performance tests, for example, the Block Design. 

COMPREHENSION TEST 

Tests of general comprehension have long been favorites with authors 
of scales, and our own results more than justify this popularity. Gen- 
eral comprehension questions are to be found in the original Binet as 
well as in all of its revisions. They occur also in many group examina- 
tions, such as the Army Alpha and the National Intelligence Tests. 
The test as it appears on the individual and group examinations, how- 
ever, cannot be said to be equivalent. One important difference is 
that on the group test the subject is asked merely to select one of a num- 
ber of possible answers furnished him by the examiner. On the test 
given individually, the subject must furnish his own answer to the 
questions. The latter way of giving the test not only reduces the 
probability of the subject hitting upon a correct answer through sheer 
chance, but also enables the examiner to evaluate the subject’s response 
even when it is incorrect. Indeed, one of the most gratifying things 
about the general comprehension test, when given orally, is the rich 
clinical data which it furnishes us about the subject. It is frequently 
of value in diagnosing psychopathic personahties, sometimes suggests 
the presence of schizophrenic trends (as revealed by perverse and bizarre 
responses) and almost always teUs us something about the subject’s 
social and cultural background. The variety of replies one gets to 
such a question as “What would you do if you found a letter that was 
already sealed, stamped and addressed”, or “Why does the state require 
people to get a marriage license?” is far greater than one would suspect, 
certainly far greater than an examiner could include in a multiple choice 
questionnaire. The following are sample replies to the former: “Bring 
it to the man’s house”. “Leave it there”. “Open it and see if there is 
any money in it”.® And here are some answers to the marriage ques- 

* Inter-test correlations for all tests are given in Appendix 

‘ The first of these answers was given by a simple defective; the' second, by a 
delinquent; the third, by a psychopath. 
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tion: *^To prevent bigamy’’; ^Tor census purposes’ ‘To protect the 
morals of the community”; ‘To protect the honor of womanhood”; 
“So people will know they are married”. 

The twelve questions which form our test list were selected from some 
thirty in a manner similar to that employed for reducing the number of 
items on our information test. A few of the questions will be recognized 
as coming either directly from the Army Alpha or, in modified form, 
from those scattered among various tests discussed in the Army Mem- 
oirs. One or two turn out to be identical with some now appearing in 
the Terman and Merrill Revision of the Stanford — probably because 
borrowed from the same source. This duplication, however, will not 
seriously affect the usability of the items. Our experience has shown 
that the Comprehension questions are among those which suffer least 
from practice effect. It is curious how frequently subjects persist in 
their original responses, even after other replies are suggested to them. 

Precisely what functions the Comprehension Test involves is difficult 
to say. Off hand it might be termed a test of common sense, and it is 
so called on the Army AJpha. Success on the test seemingly depmds 
on the possession of a certain amount of praoticabiiformation and a 
general ability to evaluate past experience. The questions included 
in our list are of a sort that the average adult may have had occasion 
to answer for himself at some time, or, in one form or another, have 
heard discussed by friends. Some of them, like the “Land” and 
“Marriage” questions, may seem unfair to yoimg persons, and in par- 
ticular to children./ Indeed, when given to children, the ans wers to 
these questions show a considerable correlation with age and social 
maturity. Although bright young children are prevented from making 
high scores on the test, it seems, however, to have no serious effect on 
their final rating. These questions, it will be noted, involve no unusual 
words, so that even individuals of foreign background and bi-lingual 
homes have no difficulty in understanding them. We have, neverthe-' 
less, found that poor verbalizers and those unaccustomed to setting 
forth their ideas in words often make low scores on the test. 

The Comprehension Test holds up very well with age and, even when 
it begins to fail off, drops less than most of our other tests. It corre- 
lates best with Similarities, and General Information and least well 
with the Digit Span and the Object Assembly. The correlation (r) with 
total score is .66® for ages 20-34 and .68 for ages 35-49. 

• For probable errors of these and other * V’s” given in this chapter, see tables 
40-44. 
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AEITHMETICAL REASONING TEST 

The ability to solve arithmetical problems has long been recognized 
as a sign of mental alertness. '"'lEven before the introduction of psy- 
chometrics, it was used as a rough and ready measure of intelligence. 
Now most intelligence scales include items calling for arithmetical 
reasoning in some form. The inclusion of such items is fully justified: 
Arithmetical Reasoning tests correlate highly^ with global measures of 
intelligence. - 

In addition to being a good measure of general intelligence the Arith- 
metical Reasoning Test enjoys the advantage of being easily devised 
and standardized. But its merits are lessened by the fact that it is 
influenced by ^education and occupational pursuit. Clerks, engineers 
and business men usually do well on arithmetic tests, while housewives, 
day laborers and illiterates are often penalized by them. Another short- 
coming of the test is that individual scores may be affected by fluctua- 
tions of attention and transient emotional reactions. 

The general appeal and interest which the Arithmetical Test has for 
most adults should be especially mentioned. Most adults look upon 
questions in arithmetic as a task worthy of a grown-up. They may be 
embarrassed by their inability to do certain problems, but they almost 
never look upon the questions as unfair or inconsequential.^ Perhaps 
our choice of problems has something to do with pro voking this attitude. 
All the problems touch upon commonplace situations or involve practical 
calculations. Moreover, they have been so devised as to avoid ver- 
balization or reading difficulties.® The knowledge required to solve 
most of our problems is not beyond that taught in the first seven grades 
of school or what the average adult could acquire for himself in the 
course of ordinary business transactions. 

While the influence of education on the individuaFs ability to answer 
arithmetical problems lessens the value of the test as a measure of adult 
intelligence, the effect of the interrelation between the two factors is not 
entirely negative. It appears that children who do poorly in arith- 
metical reasoning often have difficulty with other subjects. A number 
of our examiners reported they were sometimes able to diagnose educa- 
tional abilities on the basis of scores obtained on this test, especially 
when supplemented by scores obtained on the General Information 

^ They do so, however, to a lesser degree than certain other tests that enjoy 
less ix)pularity, e,g., the General Conaprehension and Similarities tests. 

* All but the last two questions are given orally, and even these may be read 
to the subject. 
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Test. The combined scores of these two tests frequently furnished an 
accurate estimate of the subject’s scholastic achievement. 

^The correlations between the Arithmetical Reasoning Test and total 
scale score are neither among the highest nor the lowest obtained. They 
vary with the age at which they are calculated, being generally higher 
at the upper than lower ages/ For the age group 20 to 34, the correla- 
tion between Arithmetical Reasoning and total scale score is .63, for the 
age group 35 to 49, .67. Though the test versus total scale correlations 
rise with age, the absolute scores on the test decrease significantly with 
age. Here again, the Arithmetical Reasoning Test occupies art inter- 
mediate position. It does not fail off as rapidly as most of the perform- 
ance tests or Memory Span for Digits, but neither does it hold up as well 
as the Vocabulary, General Information or General Comprehension 
Tests. 


MEMORY SPAN FOR DIGITS 

There is perhaps no test that has been so universally used in scales 
of intelligence as that of memory span for digits. It forms part of the 
original Binet scale and all the revisions of it. It has been used for a 
long time by ps^^chiatrists as a test of retentiveness and by psychologists 
for all sorts of psychological studies. Its popularity is based primarily 
on the fact that it is easy to administer, easy to score, and specific as to 
the type of ability it measures. Nevertheless, as a test of general in- 
telligence it is among the p oorest. Memory span, whether for digits 

forwards or backwards, generally correlates very poorly with all other 
tests of intelligence. The ability involved contains little of ‘‘g”, and 
as Spearman has shown is more or less independent of the general factor. 
Our own results confirm these observations. For a long time we con- 
sidered the desirability of eliminating the test from our battery al- 
together, hjut finally decided to retain it for the following reasons: (1) 
Whila niemory span for digits backwards and forwards is on the whole 
a poor measure of intelligence, it is nevertheless asT extremely good one 
at the lower levels. Except in cases of special defects or organic disease, 
adults who cannot retain 5 digits forwards and 3 backwards wiU be 
found, in nine cases out of ten, to be feebleminded.^ (2) Special 

® Rote memory more than any other capacity seems to be one of those abilities 
of which a certain absolute minimum is required, but excesses of which seemingly 
contribute relatively little to the capacities of the individual as a whole. The 
Memory Span for Digits test has the great merit of quickly indicating whether 
an individual has that relative minimum. 
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diflBculty with the repetition of digits forwards or backwards is often 
of diagnostic significance. Obvious examples are the memory defects 
which constitute clinical symptoms in certain organic and other types 
of cases. A marked failing off in memory span is often one of the earliest 
indications of them.^® 

Low scores on the Memory Span Test are frequently associated with 
attention defects. ^ Individuals with these defects seem to have a special 
diflSculty repeating digits backwards. | This deficiency is often referred 
to by psychologists as lack of mental control. The term is rather un- 
fortunate as it implies, and is often interpreted as meaning not only 
an inability to hold things before the mind, but also as a lack of self- 
control, in the broader sense. Both are spurious generalizations. 
Nevertheless, the failure to repeat digits backwards does often correlate 
with difficulties of attention and lack of ability in doing intellectual 
work which requires concentrated effort. Knowledge of this fact is 
frequently an aid to clinical diagnosis. The question, however, still 
remains whether the digit-span test might not better be used as a supple- 
mentary test rather than be included in the general intelligence scale. 

It should be noted that as included in our battery, memory span 
for digits forward and memory span for digits backward have been 
combined into a single test. This was done for two reasons: first, 
because of the limited range of each of them when taken separately. ‘ 
On memory span for digits forward, a score range of only 4 points 
(repeating 5, 6, 7, or 8 digits) includes about 90 per cent of the adult 
population, and about the same percentage is included by the ability 
to repeat 4 to 6 digits backwards. Such a range is obviously too small 
for a point scale. By combining the scores obtainable on both into 
one test measure, we succeeded not only in extending the testes range, 
but also in closing up wide gaps that obtain between successive scores 
when the tests are used singly. Our second reason for combining digits 
forward and digits backward into a single test was for the purpose of 
reducing the amount which each contributed to the total score. - Con- 
sidering their limited correlation with general intelligence, it did not 
seem reasonable to give them as much weight as the other tests of the 

10 Wells has pointed out that the relation between the number of digits that an 
individual can repeat forward and those he repeats backward is often of diagnostic 
value in certain organic cases. Alcoholics with Korsakoff syndrome, for ex- 
ample, do much better on digits forwards than on digits backwards. Where the 
discrepancy is large, it often indicates mental deterioration. See this point; 
Wells, F. L. : Menial Teste in Clinical Practice. Ch. VIIL 
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scale. If we had used each of them as a separate test, they could have 
together contributed xt instead of of the total score which they 
now do. 

Memory span for digits is too familiar a test to need further analysis. 
We include, however, some comments made by various of our examiners 
as regards their experience with it. “The effectiveness of this test 
depends upon calmness and strict attention to the material presented. 
Care must be taken not to give the test when the individual is fa- 
tigued.” “The test is sometimes influenced by the auditory factor. 
People with defective hearing sometimes fail on it because they do not 
hear the numbers distinctly.” “It is really best for picking out mental 
defectives.” The last comment corroborates the point we have stressed 
in our Introduction, namely that many of our abilities are such that 
they enter into intellectual functioning only as necessary minimum 
Ordinarily, a person who cannot repeat 4 digits forward is so handi- 
capped as far as intake of the external world is concerned that he will 
inevitably be unable to function above the mental defective level. 
A bove a span of 5 or 6 digits, the value of further improvement in the 
capacity diminishes very rapidly. A good rote memory is of practical 
value but correlates very httle with higher levels of intelligence. 

As might be expected, memory span declines rapidly with age. It^ 
holds up better, however, than certain other abilities, for example, those 
tapped by the Picture Arrangement. It would seem, therefore, that 
older people have other faihngs of which they might much more reason- 
ably complain than their poor memory. The r for the Digit-Span Test 
with total score is .51. It correlates best with Digit-Symbol and worst 
with the Object Assembly Test. 

SIMILARITIES TEST 

This test turns out to be among the best of our entire batteiy. But 
we did not include it in our testing program until several hundred sub- 
jects had been examined without it. We omitted the Similarities Test 
from our original selection because of the mistaken belief that it would 
be unduly influenced by the language factor. This view was largely 
due to our previous experience with the more difficult similarities items 
on the Stanford-Binet. / Because we had found so many adults failing 
on the difference between “idleness” and “laziness” owing to their 
unfamiliarity with the word “idleness” and so many more on the 
difference between “evolution” and “revolution” because they had never 
heard of “evolution”, /we were inclined to doubt the possibility of 
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devising a similarities test that would be free of linguistic difficulties 
at the higher levels./ A little research soon showed, however, that it 
was possible to increase the difficulty of the items without resorting to 
unusual or unfamiliar words. For example, with the words 
and ^‘tree’b one can formulate the question, ‘^In what way are a fly 
and a tree alike?” and this question is answered correctly by less than 
20 per cent of the adult population. When words like these are chosen, 
the Similarities Test can be freed almost entirely from the linguistic 
factor. /This will be obvious to the reader from an inspection of the 
paired words which we finally used for our comparisons. 

Our final list of 12 selected from almost double the number is as 
follows: 


Orange 

Banana 

Coat 

Dress 

Dog 

Lion 

Wagon 

Bicycle 

Daily Paper 

Radio 

Air 

Water 

Eye 

Ear 

Egg 

Seed 

Wood 

Alcohol 

Poem 

Statue 

Praise 

Punishment 

Fly 

Tree 


i The Similarities Test has several merits. It is easy to give and ap- 
pears to have an interest appeal for the average adult.^' It is the kind of 
test which has been recognized by all investigators as containing a great 
amount of “g”. Over and above this/ the test has certain qualitative 
features, the most important of which is the light which the type of 
responses received throws upon the logical character of the subject’s 
thinking processes. There is an obvious difference both as to maturity 
and as to level of thinking between the individual who says that a 
banana and an orange are alike because they both have a skin, and the 
individual who says that they are both fruit. As already noted by 
Terman and others, it is not until the individual approaches adult 
mentality that he is able to discriminate between essential and super- 
ficial likenesses. But it is remarkable how large a percentage of adults 
also never get beyond the superficial type of response. It is for this 
reason that, unlike previous methods of scoring, the one employed in 
our scale distinguishes between superior and inferior responses by allow- 
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ing different credits for each.! Thus, when the subject says an orange 
and banana are alike because “you can eat them’^ and a bicycle and 
wagon, “because they have wheels”, he receives a credit of 1, whereas 
the^responses, “both are fruit”, and “means of conveyance”, are scored 
2. / This qualitative difference in response is of value not only because 
it furnishes a more discriminating scoring method, but because it is 
often suggestive both of the evenness and level of the subject’s intellec- 
tual functioning. Some subjects’ total scores, even when relatively 
good, are largely made up of 1 credits, whereas the scores of others are 
of a varied proportion of 0, 1 and 2 credits. The former are likely to 
bespeak individuals of consistent intellectual ability, but belonging to 
a type from whom no liigh grade of intellectual work may be expected; 
the latter, while erratic, have many more possibilities. 

Our correlations for the Similarities Test are not as complete as those 
of other tests, but they are uniformly among the highest obtained either 
with the individual tests or with total score. J The test’s r with the 
rest of the scale is .73. 

PICTURE ARRANGEIMENT TEST 

/ This test consists of a series of pictures which, when placed in the right 
sequence, tell a little story. The picture series are not unlike the short 
comic strips to be found in the daily papers. The pictures are presented 
to the subject in a disarranged order and he is asked to put them to- 
gether in the right order so that they make a sensible story. The correct 
order is the one originally given to the pictures by the artist. 

A test of this sort was seemingly first used by DeCroIy.^^ In 1917 
several Picture Arrangement series were tried out by the Army psychol- 
ogists, as subtests on a group examination, and found wanting; but 
another set (the Foxy Grandpa series) ultimately found its way into 
the Army Performance Scale,^^ It was, however, not used to any great 
extent. Nor have other tests of this kind, — possibly because of the 
difficulties in scoring as well as in getting up good sequences, — had great 
vogue in this country. 3ut recently Cornell and Coxe^^ again experi- 
mented with some picture series and included them in their scale. 

The picture series which we have chosen for our test consist of seven 

DeCroly, I.: Epreuve nouvelle pour Pexainination mental. Ann. Psychol., 
1914, XX, 140-159. 

Army Memoirs. Nat. Academy of Sciences, vol. XV, pp. 189 and 404. 

“ Cornell, E. L. and Coxe, W. C. : A Performance Ability Scale. New York, 
1934. 
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sets, three adapted from the Army Group Examinations and four en- 
tirely new ones selected from Soglow’s well known King series which 
appeared in the New Yorker magazine several years ago. Those 
adapted from the Army Group Tests were completely redrawn and in 
some instances slightly altered as to content? 

/ The set of pictures included in our battery represent the final choice 
from among more than twice that number originally tried out. They 
were selected on the basis of interest of content, probable appeal to 
subjects, ease of scoring and discriminating value. Any attempt to 
satisfy all these conditions was bound to occasion difficulties, and in 
spite of the considerable labor spent before making definitive choices, 
the final selection leaves much to be desired. The fault, however, is 
not so much with our particular selection as with the limitations inherent 
to all picture arrangement tests, namely their dependency upon actual 
content.i It is of some moment whether the story told by the pictures 
is that of a bird building a nest, or a policeman pursuing a thief in a 
radio car. The former is a situation a country boy may grasp at once, 
the latter may puzzle him a good deal; and what holds for such simple 
situations plays an even greater rdle when the story told by the pictures 
is more complicated. ^ Our aim was naturally to choose situations from 
the American scene, but by so doing we also succeeded in making a num- 
ber of our pictures very puzzling to subjects of foreign origin. In some 
other instances, we found that subjects of a particular cultural level 
were rather favored by one or another of the series./ For example, 
some of the Soglow pictures, which are not only among the best, but also 
among the most difficult, were surprisingly easy for the sophisticated 
type of individual. 

’ In spite of the above limitations, the Picture Arrangement test has 
9ome very worthwhile merits. In the first place, it is the type of test 
which effectively measures a subject’s ability to comprehend and size 
up a total situation, f The subject must understand the whole, must 
get the “idea” of thekory before he is able to set himself effectively to 
the task. There is, of course, some trial and error experimentation, but 
the subject is also called upon to attempt appraisal of the total situation 
more fhau in most other tests. Secondly, the subject matter of the test 
nearly always involves some human or practical situation. The imder- 
standing of these situations more nearly corresponds to what other 
writers have referred to as “social intelligence”.^^/ iWe ourselves, as 
already indicated, do not believe in such an entity. Our point of view 

“Alas, both deliniiueiits and psychopaths often do very well on this test. 
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is that social intelligence is just general intelligence applied to social 
situations. Nevertheless, we have observed that individuals who do 
fairly well on the Picture Arrangement seldom turn out to be mental 
defectives even when they do badly on other tests/ 

A word as to the method of scoring. In a test of this kind the ques- 
tion always arises whether one should allow part credit for possible but 
incorrect combinations. The answer depends upon how much the test 
gains, that is, improves its correlations with total score when such 
allowance is made. /For our short picture series the gain was practically 
nil and accordingly lOur first three picture series were given either full 
or no credit at all depending upon whether they were or were not ar- 
ranged in the exact way called for. With the longer picture series,— 
those consisting of 5 or 6 pictures, — seme gains were to be had by 
crediting certain combinations. On these, partial credit is allowed for 
various imperfect arrangements providing they make “sense^’f What 
makes sense was determined by a group of four judges after inspecting 
actual arrangements obtained from over 200 subjects. In general, the 
number of credits assigned to imperfect arrangements were roughly 
proportioned to the frequency with which the several arrangements 
occurred. Experimentation with different statistical procedures showed 
that little if any advantage was to be had from refined scaling and 
scoring . : The arrangements given in our manual cover pretty well most 
of the rational arrangements of pictures which the individual series 
permit. ^ Occasionally, however, a subject does produce a different one 
for which he is able to give a convincing explanation, but for which no 
credit is allowed in our tables. In such cases, it will be found that 
disallowing the subject’s response does not materially influence his total 
score on the test; but provision is made for the examiner to credit the 
subject with a reasonable additional score. More interesting than the 
question of credits allowed, however, in such cases, is the explanation 
which the subject may give for his unusual arrangement. Consistently 
bizarre explanations are suggestive of some peculiar mental orientation 
o^ even psychotic trend. I 

I The Picture Arrangement test holds up age and correlates 

poorly with the other tests of our scales. 1 It correlates best with Picture 
Completion and Similarities, and poore^ with Digit Span and Object 
Assembly. Its r with total score is .51. 

PICTURE COMPLETION TEST 

The name “Picture Completion” is usually associated with a test like 
the Healy Picture Completion II, in which the subject is required to 
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complete the sense of a picture by selecting a best fitting piece from 
among several possible choices. The test presented here under the 
same name is somewhat different. 'It merely requires the subject to 
discover and name the missing part* of an incompletely dra\\m picture/ 
He is shown a picture, like a steamship minus its funnel or a watch with 
its second hand missing, and asked to indicate the missing part. / In its 
present form the test is very much like that of the “Mutilated Pictures’" 
of the Binet Scale, of which it may be considered an extension. | 

( A test of this sort forms part of many Group Examinations.^® Its 
popularity is fully merited^® even though the procedures used in adapting 
it for group testing generally limit its possibilities. One of these limita- 
tions is that the subject is required to draw in the missing part; another 
that the number of items used have generally been too few and often 
far from satisfactory. Preliminary experiments with pictures pre- 
viously used on group examinations showed that, for the most part, they 
were haphazardly chosen. Many of the items were much too easy and 
some unusually difficult. On the whole, the individual pictures showed 
little discriminative value.' 

Suitable items for a Picture Completion Test are hard to find and 
present a number of difficulties. If one chooses familiar subjects, the 
test becomes much too easy; if one turns to unfamiliar ones, the test 
ceases to be a good test of intelligence because one unavoidably calls 
upon specialized knowledge. The 15 pictures included in our test are 
the final selection from some 30 to 35 tried out over a period of six 
months with various groups of subjects of known intelligence levels. 
Each picture was tried out separately and admitted or rejected on the 
basis of its discriminating^^ value. While there are a few with which 
we are still not satisfied, our final set of pictures seems to be a satisfactory 
series, / Altogether they have furnished a test which, next |^o the Block 
Design, turns out to be the best of our Perfonnance Scale. ; The test is 
particularly good in testing intelligence at the lower levels. Ostensibly 
it measures the individual’s basic perceptual and conceptual abilities in 
so far as these are involved in the visual recognition and identification 
of familiar objects and forms. To be able to see what is missing from 

Among the more familiar are the Army Beta, the Pintner Non-Language, the 
Haggerty Delta, the Detroit Kindergarten and some of the early forms of the 
Thorndike College Entrance Tests. 

On the Army Beta the test correlates .74 with total score and .72 with the 
Stanford-Binet Mental Age, 

The method by which this was done was similar to that employed in selecting 
the General Information items. See text and note on p. 79. 
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any particular picture, the subject must first know what that picture 
represents. But, in addition, he must be able to appreciate that the 
missing part is in some way essential either to the form or function of 
the object or picture. In a broad way, the test measures the abihty of 
the individual to differentiate essential from unessential details. But we 
must note again that the ability of an individual to do this depends in a 
large measure upon his relative famiharity with the object with which he 
is presented, that is to say, upon the actual content of the picture. ^ A 
person who has never seen or read about a steamship cannot be expected 
to know that all such boats have funnels and that these are generally 
to be found at the center of the ship. Unfamiliar, specialized and 
esoteric subject matter must therefore be sedulously avoided when 
choosing pictures for this test. However, this cannot be done al- 
together. If nothing else, there is always the factor of sex differences 
to be considered. For example, in examining the incidence of correct 
responses to our own series, we found that more men than women failed 
to detect the missing eyebrow in the picture of a girPs profile (item no. 
14) and more women the missing thread in the drawing of the electric 
bulb (item no. 13). 

\ The Picture Completion as given on the Bellevue Scale is not a timed 
test. Each of the itenos is exposed for 15 seconds, and this is more than 
sufficient for most subjects. Most responses are made in less than 10 
seconds. When a subject fails to detect the missing part within 15 
seconds, experience has shown that he is unlikely to do so thereafter. 
Increasing the time limit up to 30 seconds per picture had little effect 
on total scores, f 

I The Picture Completion holds up with age better than most other 
performance tests. Its main limitation is its ‘dow ceiling”, that is to 
say, its relative inadequacy in discriminating between higher levels of 
intelligence (high-average and superior), 'j There is a tendency for scores 
to pile up at the upper end of the distribution table. |On the other hand, 
its discriminating power at the intermediate levels more than makes up 
for this limitation. As already noted, it is particularly effectiye in 
picking out mental defectives. f Correlation with total score (ages 20 
to 24) is .61 / As regards individual tests, it correlates highest with the 
Block Design and lowest with the Memory Span for Digits. | 

block: design 

^ The Block Design test was originated by Kohs who offered it as a 
comprehensive measure of non-verbal intelligence. Wlnle we do not 
believe that any single test can be so used, our own results indicate that 
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its author’s enthusiasm for it is fully justified. The test turns out to be 
our best single performance item, f It conforms to all statistical criteria 
for a “good” test, fit not only correlates well with total score and 
individual test items, but is one of the few performance tests that seem- 
ingly does measure very much the same sort of thing that verbal tests 
measure. In fact, it correlates better with Comprehension, Informa- 
tion and Vocabulary than some of the verbal tests themselves./ It is 
hard to explain the basis for its merits other than that the reproduction 
of the type of design devised by Kohs in some way involves both syn- 
thetic and analytical ability. Cuiiously enough, individuals who do 
best on the test are not those who see, or at least follow, the pattern as 
a whole, but such as are able to break it up into small portions. / In this 
connection, the recent study of NadeP^ on intellectual disturbances 
following certain (frontal lobe) brain lesions is of interest. As between 
“following the figure” and breaking up the design into its component 
parts, the patients with frontal lobe lesions in contrast to the normal 
control group used almost exclusively the former method. 

{ The Block Design as adapted for our scale is basically similar to that 
employed by Kohs in his original standardization, but its content has 
been modified to a considerable degree. The most important of the 
changes introduced pertain to the reduction in the number of test cards 
used and the alteration in the figure patterns which the subject is asked 
to reproduce. The reduced number of designs was for the obvious 
purpose of cutting down the time allowed for any one test on our scale. 
Our Block Design Test consists of 7 instead of 17 figures, with a conse- 
quent reduction in the time required for completing the test from some 
35 to somewhat less than 10 minutes. The change in pattern was 
ejffected, both to avoid reproduction of items used on other scales^^ and 
to eliminate the possible factor of color confusion. The original Kohs 
included figures made up^ of red, yellow, blue and white; ours makes 
use only of red and white.| Our mode of presentation also differs some- 
what, as may be seen by consulting the instructions given in our manual. 

(The Block Design is not only an excellent test of general intelligence, 
but one that lends itself admirably to qualitative analysis. One can 
learn much about the subject by watching “how” he takes to the task 
set himl We have already mentioned the matter of the differences in 
method that may be employed in assembling the designs, — to wit, that of 

^*Nadel, A. B. : A Quantitative Analysis of Behavior Following Cerebral Le- 
sions. Arch, of Psychol, f no. 224, New York, 1938. 

The Kohs cards form, part of the Grace Arthur Scale. 
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following the figure as against that of breaking it up into its component 
parts. I There is also a difference of attitude and emotional reaction on 
the part of the subject which gives us valuable information about him. 
One can often distinguish the hasty and impulsive individual from the 
deliberate and careful type, a subject who gives up easily or gets dis- 
gusted, from the one who persists and keeps on working even after his 
time is up, and so on of a number of other temperamental traits which 
manifest themselves not infrequently in the course of a subject’s per- 
formance. ] 

The test’s value for diagnostic purposes is particularly worth mention- 
ing. Patients with mental deterioration and seniles have particular 
difficulty in managing the test and often cannot complete the simplest 
design, however much they try. So do most cases of brain disease. 
The difficulty here seems to be a certain lack of synthesizing ability, 
or loss of the “abstract approach”, in K. Goldstein’s sense of the term/ 
Nadel points out that in many cases of frontal lobe lesions, the patient’s 
inabihty to reproduce the design could be explained on the basis of a 
loss of ability to “shift”. Some of the patients seemingly did not know 
when the designs were finished, others had particular difficulty in 
attending to the figure’s color and pattern simultaneously, and still 
others, or the same patients, would get stuck at certain portions of the 
figures, apparently unable to synthesize the rest of the pattern with it. 
Whatever the mechanisms involved, the Block Design is one of the 
best tests we have for showing up early as well as late disturbances in 
the higher perceptual processes. 

Among examiners’ comments on the test we find the following: “The 
test involves the ability to perceive forms and to analyze these forms.” 
“It involves the ability to perceive pattern.” “In the Block Design, 
speed and success (of reproduction) is largely dependent upon the 
individuaTs abihty to analyze the whole into its component parts.” 
“Older adults do not do so well on it.” “It is very good for picking 
out low grade people.” “Artists and artisans do much better on the 
test than others.” “The Object Assembly and Block Design Tests 
seem to get at some sort of creative ability.” “Some subjects are 
penalized by the time score and by the fact that they haven’t played 
with blocks for a long time.’ ” “This test and the Object Assembly are 
perceptibly mfluenced by a person’s occupation”. 

I With the possible exception of the Similarities and the Vocabulary, 
the Block Design correlates more highly with total score than any of our 
other tests including the verbal ones. / Its “r” with total score (for 
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ages 35-49) is .73. Its highest single correlations are .60 with Informa- 
tion (ages 35-49) and .57 with Picture Completion (ages 20-34). / its 
lowest correlations are with Object Assembly .32 (ages 20-34) and Pic- 
ture Arrangement .37 (ages 35-49). These correlations are more like 
those obtained with the verbal than with the performance items of the 
scale. In spite of its high correlations, the test does not hold up well 
with age.4 Persons over 40 do progressively w^orse at it as they grow 
older. Considering the fact that the test correlates highly with all 
measures of general intelligence, we interpret this decline as one of the 
best proofs of the natural falling off of intellectual ability with age. For 
the same reason it may be regarded as an excellent measure of deteriora- 
tion, a conclusion which is amply confirmed by the study of individual 
cases. I 

DIGIT SYMBOL TEST 

/The Digit S3Tnbol or substitution test is one of the oldest and best 
established of all psychological tests. It is to be found in a large variety 
of intelligence scales, and its wide popularity is fuUy merited. The 
subject is required to associate certain symbols with certain other sym- 
bols, and the speed and accuracy with which he does it serve as a meas- 
ure of his intellectual ability. In deciding upon its inclusion in our 
battery, the only question in our minds was the degree to wliich the 
purely motor aspect of it might penalize the performance of adults. 
This motor factor turned out to be less important than originally sus- 
pected from certain points of view, and rather more important as regards 
certain others.'’^ In the case of illiterates who have not had practice in 
the use of pencil and paper, it is particularly disturbing. One does not 
ordinarily realize what a difficult task the formation of numerals and 
letters may be for a person who has never learned how to wiite. Such 
individuals approach the task much like a schooled person might when 
attempting to write the letters of a foreign language for the fimt time. 
The letters are not written but copied. Obviously, then, ^fiie test 
cannot be used on illiterates, and in the examination of such individuals, 
it is omitted from our scale. In the case of literate older subjects, the 
problem is of a different order. Here the difficulty is not lack of famil- 
iarity with paper and pencil, but that of motor set and motor speed. 
To the extent to which these correlate -with an actual loss in learning 
ability, the factors cannot, of course, be said to lessen the value of the 
test. But the question is whether motor speed does not have an inde- 
pendent and disturbing influence. ||The actual facts are these; the mean 
scores on the Digit Symbol Test begin to drop after 25, and after 35, 
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and especially 40, they taper off very markedly with age. All this 
would be very discouraging except for the fact that the falling off of 
ability turns out to be more or less proportional to the original capacity 
of the individual as measured by the total score on the scale. This is 
because there is a high correlation between test scores on the Digit 
Symbol and total scores on the scale as a whole; and to the fact that in 
the calculation of our intelligence ratings (see page 219) allowance is 
made for the decline of general ability with age/ 

f We should note in passing that neuro tic and unstable individuals also 
tend to do rather badly on the Digit Symbol (as indeed on all other sub- 
stitution tests). The inferiority of neurotic subjects on tests of this 
kind was noted as long ago as 1923 by Tendler.^° Tendler suggests that 
this is due to some sort of associative inflexibility in the subject, and a 
tendency toward mental confusion. | More obviously neurotic subjects 
do badly on this test because they have difficulty in concentrating and 
applying themselves for any length of time and because of their emo- 
tional reactivity to any task requiring persistent effort. The poor 
performance of the neurotic represents a lessened mental efficiency 
rather than an impairment of intellectual ability. 

iThe Digit Symbol Test incorporated in our scale was taken from the 
Army Beta.^^ This particular form of Substitution Test (originally 
devised by Otis) has several advantages over many of the others com- 
monly used. One is that it comprises a sample demonstration which 
permits the examiner to make sure that the subject understands the 
task. Another is that the subject is required to reproduce the unfamihar 
symbols and not the associated numerals. The latter fact lessens the 
advantage which individuals having facihty with numbers would other- 
wise have. The only change made from the way the test is administered 
is in the matter of time allowance. The two minutes allowed on the 
Army Beta was found to be too long. There was a tendency for scores 
to pile up at the upper end. Reducing the time not only elimmated 
this shortcoming, but also improved the distribution of test scores when 
these were converted into standard deviation equivalents. Several 
different time allowances were tried out, and a period of minutes was 
found to give best results./ 

^he Digit Symbol Test correlates best with the Memory Span for 


Tendler, A. D.: The Mental Status of Psychoneurotics. Arch, Psychol. 
no. 60, 1923. 

It likewise forms part of the original Army Performance Tests; the test has 
also been included in the Cornell-Coxe Performance Scale. 
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digits and poorest with Object Assembly./ Its correlation (“r”) with 
total score is .673 for ages 20-34 and .697 for ages 35-49. As already 
mentioned, Digit Symbol performance declines rapidly with age. 

OBJECT ASSEMBLY 

I The test which is included under this title, consists of three separate 
figure formboards, a Manikin, a Feature Profile, and a Hand. The 
Manikin is essentially the same as that devised by Pintner and fiirst 
used on the Pintner-Paterson scale,^^ except that the suggested features 




Hand and TEATimE Profile 


have been redrawn to make them more human in appearance. Our 
Profile resembles that used by Pintner-Paterson but differs from the 
original in several respects. It is a profile of a woman’s head instead of 
a man’s, the ear is divided into two instead of four parts, and a piece has 
been cut out at the base of the skull. The Hand is entirely new and was 
devised by the author. As presented to the subject, it consists of a 
mutilated hand from which the fingers and a large section of the palm 
have been cut away. \ The assembled Hand and our modified Profile 
are shown in figures 7 and 8. Details as to method of presentation and 
scoring will be found in the manual portion of the book. 

“ Pintner, R. and Paterson, R, G.: Scale of Performance Tests. N. Y., 1921. 
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The Object Assembly was included in our battery only after much 
hesitation. We wanted at least one test which required putting things 
together into a familiar configuration. Our experience over a long 
period with the commonly used formboards, had convinced us that 
whatever their merit when administered to children, they were generally 
ill-adapted for testing adults. IXIost of the standardized formboards 
are much too easy for the average adtilt, and at the high levels have 
very little discriminative value. / The distribution tables^^ for these 
formboards moreover have unusually large scatter. Taken singly, 
the predictive value for most of them is nearly nil. I The Manikin 
and Feature Profile seemed better in this respect than most of the form- 
boards, but not much. Like all formboards, they also show great 
I^ractice effects, and, therefore, are almost useless for retest purposes., 
} In spite of the foregoing hmitations, the Object Assembly Test has a 
number of compensating features, and it is primarily because of these 
that we kept it in our battery. The first point to be noted is that while 
the test correlates poorly with almost every one of our subtests, it does 
contribute something to the total score. Secondly, examination of the 
Object Assembly scatter diagrams show that the low correlations it has 
with the other tests, are due primarily to the large deviations of a rela- 
tively small and seemingly special group of individuals. This means, 
perhaps, that the Object Assembly is a poor test only for certain types 
of individuals.! If the test is appraised on the basis of criteria which are 
not influenced in a marked degree by the atypical individual, its rating 
is considerably enhanced. For example, if we consider mean scores 
alone, we find that the Object Assembly shows a rather good rise with 
age up to about 13 years, and a maintenance of the average adult norm 
considerably^ better than some of the other tests.^^ 

I ¥he best features of the Object Assembly, however, are its qualitative 
merits. Various examiners have praised the test repeatedly, because 
“it tells you something about the thinking and working habits of the 
subjects.” The subjects’ approach to the task may in fact be one of 
several kinds. The first, is that of an immediate reaction to the whole, 
accompanied by a critical xmderstanding of the relation of the individual 
parts. This is particularly true of responses to the Manikin test, where 
one can distinguish between the individual who recognizes from the 

For distribution tables, see Pintner-Paterson, op. cit., pp. 97-137. 

** The test is weakest in the intermediate age range, i.e., from about age 15 to 
30, where it is extremely erratic for individual cases. Part of this is due to its 
marked susceptibility to momentary changes of attention and interest. 
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start that he has a human figure to put together, from another, usually 
a mental defective, who has no idea what he is assembling but merely 
fits the pieces together by the trial and error method. A second type 
of response is that of rapid recognition of the whole but with imperfect 
understanding of the relations between the parts. This is best evi- 
denced by the manner in which many subjects handle the Feature 
Profile. Still a third type of response is one which may start with com- 
plete failure to take in the total situation, but which after a certain 
amoimt of trial and error manifestation leads to a sudden though often 
belated appreciation of the figure. Such performances are most fre- 
quently met with in the case of the Hand. Altogether, then, we may 
say that the Object Assembly test has a particular clinical value because 
it tells us something about one’s mode of perception, the degree to 
which one relies on trial and error methods, and the manner in which 
one reacts to mistakes. i 

Among the comments made on the test are the following: ‘‘The 
Object Assembly, like the Block Design Test, seems to get at some sort 
of creative ability, especially if the performance is done rapidly.” 
“Successful reproduction of the Object Assembly items depends upon 
the subjects’ familiarity with figures and their ability to deal with the 
part-whole relationship.” “People with artistic and mechanical ability, 
seem to do very well on this test.” “It sometimes reveals the ability 
to work for an unknown goal.” “Some subjects continue working at 
putting together the Hand although they seem to have not the slightest 
notion as to what it is they are putting together.” To this extent the 
tests are of value in revealing the capacity to persist at a task. Some 
subjects tend to give up very quickly and are discouraged by the slight- 
est evidence of lack of success. 

As already mentioned, the Object Assembly shows the smallest corre- 
lation with all other tests when taken individually or collectively. It 
correlates least well with the Memory Span for Digits and best with 
the Block Design. Interestingly enough, the correlation with total 
score in the case of adults, rises significantly with increasing age.f Its 
“r” is .41 for ages 20-34 and .51 for ages 45-49. With respect to the 
particular abilities involved in this test, it would seem that older people 
react more nearly like children than do average adults. 

VOCABIJLART TEST 

Contrary to lay opinion, the size of a man’s vocabulary is not only 
an index of his schooling, but also an excellent measure of his general 
intelligence. Its excellence as a test of intelligence is seemingly derived 
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from the fact that the number of words a man knows is at once a measure 
of his learning ability, his fund of verbal information and of the general 
range of his ideas. The one serious stricture that can be made against 
the Vocabulary Test as a measure of a man’s intelligence is that the 
number of words a man acquires must necessarily be influenced by his 
educational and cultural opportunities. It is seemingly unfair to 
illiterates and persons with a foreign language handicap. Because of 
these possible limitations, the Vocabulary Test was included only as an 
alternate in the original battery. Our apprehension regarding the 
Vocabulary has proved unfounded, and it is now recommended as a 
‘‘regular” test that should be included systematically when the Full 
Scale is administered. ■ 

i The words included in our Vocabulary list were taken from one of 
the Funk and Wagnall’s Standard (School) Dictionaries. The list was 
arrived at by choosing 100 words at random in the following manner: 
Beginning with an odd page, we selected every top word but one^® in 
the left hand column of every fifth page and continued the process until 
we had listed 100 words. These 100 words were arranged in rough 
order of difficulty, divided into two lists of 50, each and given to experi- 
mental groups of known intelligence level. By this method the words 
which seemed to discriminate poorly between different intelligence 
levels, were quickly eliminated. After some further experimentation 
the two lists were consolidated into one of 60, then of 50 words, and 
finally cut down to the present list of 42. Their order of difficulty, at 
least for the individuals in the New York City area, is fahly consistent, 
except that there are always certain words which are especially easy^® 
or hard for some individuals.! 

, Apart from its value as a measure of intelligence, the Vocabulary is 
an especially desirable test to have on any scale, because of its qualita- 
tive possibilities. In defining a word, a subject gives us more than its 
mere meaning. In many instances he tells us a good deal about himself, 
or at least about the quality and character of his thought processes. 
These facts may not be immediately apparent but can be frequently 
brought to light by analyzing the formal and contextual aspects of the 
subject’s response- | There is an obvious difference in the reasoning 
ability^^ between two adults, one of whom defines a “donkey” as “an 

Omitting however, obsolete, technical or esoteric words. 

** Since the war the words nitroglycerine y espionage and Jiara-kari have become 
much easier, and their altered difficulty must be allowed for. 

We entertained for a long time the possibility of using preciseness and 
accuracy of definition as a basis for scoring, but actual attempts to do so proved 
impractical. 
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animar’ and the other who defines it in such terms as “it has four legs^ 
or that “it looks like a jackass.’^ Sometimes the quality a subject s 
definition tells us something about his cultural milieu. |The type of 
word on which a subject passes or fails is alwa3^s of some significance, i 
Dull subjects from educated homes often get uncommon words like 
“vesper’’ and “pewter” but fafi on “gamble” and “brim”; the pedant 
will get “espionage” but fail on “spangle”, etc. But perhaps more 
important than all these from a clinical point of view is^ the character 
of a definition as a whole in so far as it may give us some insight into an 
indhfidual’s thought processes. This is particularly true in the case 
of schizophrenics, the formal aspects of whose language disturbance is 
frequently diagnostic. We do not refer to such obvious aberrations^ as 
“neologisms”, “word-salads”, etc., but the more subtle ones pertaining 
to bizaiTcness of ideational content and pecuharities of expression. 
Here are some examples of words defined by schizophrenics, which 
illustrate the kind of definition one meets with in such patients . PluTcd 
“a way of thinking in grammar”. Bacon — “part of pig associated with 
breakfast”. Armory — “an institute of future training to no purpose, 
where guardsmen gather”. Guillotine — “to disconnect a person’s body 
by means of a knife”. Nail—'^s. metal object used for putting things 
together”. The last two though not strikingly aberrant, are of par- 
ticular interest, because illustrative of the pedantism and rigidity so 
frequently met with in schizophrenia. Other vocabulary character- 
istics of schizophrenia are marked perseveration, redundancy of defim- 
tiop. and incoherence of content. 

|ln estimating the size of a person’s vocabulary, items of the kind just 
discussed cannot enter into the evaluation. What counts is the number 
of words that he knows. Any recognized meaning is acceptable, and 
there is no penalty for inelegance of language. So long as the subject 
shows that he knows what a word means, he is credited with a passing 
definition. Most of the words on our list are easily scored either right 
or wrong, but to help beginners we have appended a list of criteria (in 
the manual part of the book) which will enable the examiner to score 
the doubtful responses more easily. The general rule when in doubt is 
to match the subject’s responses against the acceptable definitions and 
to score accordingly. Wherever possible it is desirable to record the 
subjects’ responses verbatim. This will not only provide for a subse- 

On this subject see for example. White, W. A.: “The Language of Schizo- 
phrenia,’^ in vol. V. of the Research Publications of the Association for Research 
in Nervous and Menial Disease, Schizophrenia. New York, 1928. 
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quent re-checking, but also furnish the material for the qualitative 
analyses discussed above. 

The Vocabulary Test correlates highly ^vith our total scale {eta == 
.85). In the original standardization we did not have sufficient data 
to obtain its correlation with the separate tests of the scale, but subse- 
quent studies by other investigators^® show that they are an high order. 
In conformance with the experience of other investigations our owm re- 
sults show that the Vocabulary Test holds up very well mth age. | Our 
results, however, do not confirm the view that it remains constant right 
up to senescence. Although the Vocabulary Test does not fall off with 
age anywhere near as much as most other measures of intelligence, it 
does faU off somewhat. This fact must be borne in mind when using the 
test for measuring mental deterioration. 

2® Goldfarb, W.: Teacher^ s College Contributions to Education^ No, 881, 'pp 
28-29. (See reference 9.) 



CHAPTER 8 


POPULATION USED IN STANDARDIZING THE 

TESTS 

In the present chapter we propose to describe briefly the chief charac- 
teristics of the population on which our scales have been standardized, 
and indicate the basis emplo 3 ^ed in arriving at our samples. It is clear, 
of course, that whatever the intrinsic merits of an intelligence scale, 
its diagnostic value must, to a large extent, depend upon the degree to 
which the characteristics of the originally tested groups will approximate 
those of the general population to whom the tests will be subsequently 
administered. 

Statisticians are wont to sum up this condition by saying that the 
standardizing sample must be truly representative of the entire popula- 
tion. This is a very nice statement of the case, but in practice not a 
veiy helpful one. What we want to know precisely is what factors 
must be taken into consideration to assure us of such representative 
sampling- The formulae which statisticians generally fiumish us merely 
afford us means of estimating the probability that a sample of such and 
such size will give us mean scores that will only deviate from that of the 
total population by such and such amounts. They tell us little about 
the qualitative differences which we are likely to find between our 
original sampling and the larger population to which the test may 
ultimately be given. For this we need to know a good deal about the 
qualitative characteristics of our group, and the variety of factors 
winch inevitably act to differentiate various sub-groups in any large 
population. Unfortunately, this knowledge is not easily obtained, 
and even when at hand, extremely difiScult to evaluate. Our own 
attempts to deal with the various problems presented leave much to 
be desired. We shall therefore limit ourselves to only a summary 
analysis of such of these influencing factors as we tried to deal with. 

The first of these factors is that of age. In the case of children its 
bearing on test scores is so obvious that separate norms for different 
ages have been the rule almost from the first day that intelligence scales 
were adopted. But in the case of adults this has not generally been 
the case. The practice here (as for instance in the matter of calculating 
I.Q.^s) has been to treat all individuals from 15, 16 or 18, as an homo- 
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geneous group so far as the age factor is concerned. This assumption 
is unwarranted, and, as we have seen, has led to serious difficulties. 
We cannot use the norms for a boy of 16^ in evaluating the performance 
of a man of 60 any more than we can use the norms of a child of 6 in 
evaluating that of a boy of 16. The age factor has to be taken into 
account, and we have done so by establishing different age norms for 
all ages up to 60. The age distribution of subjects used in the standard- 
izing sample is given in table 7. It should be noted that the number 


TABLE 7 

Age distribution of subjects used in standardizing samples 


CHILDBSN 

j ADui;rs 

Age group 

Number of cases 

Age group 

Number of cases 

7 

50 

17-19 

100 

8 

50 

20-24 

160 

9 

50 

25-29 

195 

10 

60 

30-34 

140 

11 

60 

35-39 

135 

12 

60 

40-44 

91 

13 

70 

45-49 

70 

14 

70 

50-54 

55 

15 

100 

55-59 

50 

16 

100 

60-70 

85 

Totals* 

670 


1,081 


* These totals represent only the number of cases used to obtain the norms. 
The actual number of cases tested and from which they were ^‘sampled” was much 
greater. The actual totals (up to June 1938) were: ages 6-16, 1639 cases; ages 
17-6^, 1860 cases; total, 3499 cases. 

of cases of each age level does not represent the total number of indi- 
viduals tested, but only those we included in the standardizing sample. 

The second important factor in the standardization of any intelligence 
test, especially one for adults, is that of education. Practically all 
studies show that educational attainment (as measured by the number 
of years of school attendance) and intelligence ratings (as measured 
by test scores) correlate to a relatively high degree. The correlation 
ranges in most cases from about .60 to .80. For our own adult popula- 
tion we have found it to be .64 (and with mental defectives omitted, 
.53). A correlation of this order might suggest that the ability to do 
well on intelligence tests is largely dependent upon formal education, 
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and has so been interpreted by a number of authors.^ The situation, 
however, is not so simple. 

Although the correlation between educational attainment and intelli- 
gence level is high, it does not necessarily follow that the latter is de- 
pendent on the former. It might also mean that the education which 
an individual is capable of attaining, is dependent upon his intelligence. 
This is obviously the situation in the case of mental defectives. For 
example, a defective of 15 (Binet I.Q. of circa 65) is ordinarily not 
capable of going beyond the fifth grade in school. On the other hand, 
children of superior intellectual endowment are usually ahead of the 
age-grade placement, and, other things being equal, will by the time 
they are 15 attain greater scholastic achievement than the children 
of average intelligence. We do not cite these instances to prove that 
the amount of schooling an individual achieves depends primarily on his 
native endowment, but merely to show that the interpretation of the 
correlation between school achievement and intelligence scores is not so 
simple as some investigators have considered it. All intelligence tests 
are influenced by the amount of schooling that an individual has had, 
but this schooling must be understood in the larger sense. It includes 
not only education received in the class room, but also such as is self- 
acquired. 

How to make allowance for the educational factor is not easy to say. 
Ideally it could perhaps be done by standardizing one^s tests and 
obtammg one’s norms on groups differentiated according to amount of 
schooling had. To do so, even for a limited number of categories, 
however, would require an enormous number of individuals. For 
example, if only five educational levels were taken into account we 
would have to increase our standardizing population fivefold, if eight 
categories, eightfold, and so on. That is hardly feasible at present. 
Some method of circumventing the problem is necessary. Such method 
would, of course, not take into account the individual case, but only 
the group as a whole. For example, it might be possible to select our 
standardizing group in such manner that the educational attainment 
of its average individual would be approximately equal to that of the 
average individual of the larger group for which the tests are destined. 
We have managed this for our age group 10 to 16, and the precise method 
by which it was accomplished will be presently described. 

The task of compensating for educational difference in the case of 

^ Cf. Purt, C.: Menial and Educational Tests. London, 1924. 
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adults is a much more difficult problem. It is more difficult, not only 
because the actual educational distribution of the adult American 
population has not as yet been worked out, but also because the educa- 
tional status must be set against other competing differentiae; for 
example, that of occupational status. The latter indeed offered a 
much more satisfactory basis of selection, and was the one ultimately 
chosen for our own adult sampling. Nevertheless, the question as to 
how our own sample compared as regards educational level with that 
of the population of the country as a whole, seemed one of considerable 
importance. We determined this by obtaining from the Educational 
Office of the Department of Interior^ estimates as regards the percentage 

TABLE 8 


Percentages of U. S, population and Bellevue adult sampling in respective 
educational categories 


EDUCATIONAL CATEGORY 

ESTIMATED LEVEL OP i 
EDUCATION OP U. S. 
ADULT POPULATION — j 
AGES 21 TEARS AND 
OVER (1934) 

ACTUAL EDUCATIONAL 
LEVEL OF BELLEVUE 
SAMPLING— AGES 

20 YEARS AND OVER 

(1935-1938) 

College graduates 

percent 

2.93 

per cent 

5.10 

Some college work 

4.08 

3.77 

High school graduate (only)^ 

6.85 

10.81 

Some high school work 

18.99 

18.76 

Elementary school graduate (only)* 

18.68 

28.85 

Some elementary school 

43.58 

30.17 

Illiterates. 

4.69 

2.55 ^ 


Total 

99.80 

100.01 



* The discrepancies in these categories are seemingly due to a difference in 
definition of what constitutes a public school and high school graduate (see 
text). 


of individuals reaching various grades at school, and compared the dis- 
tribution of this schooling with that of our own sample of population. 

The estimated schooling of the American population as furnished by 
the Education Office, and the corresponding figures for the educational 
categories as found in our own sample, are given in table 8. The only 
major difficulty we had in making the comparison was in deciding 
whether 8th grade grammar school meant merely reaching the 8th grade 

* We are indebted for this information to Dr. David Segal, Educational Con- 
sultant and Specialist in Tests and Measurements at that office.- 
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or completing it, and similarly in the case of the 4th year high school. 
In our own sample, we set 8 years public school and 4 years high school 
as meaning either mere attainment or actual completion of these grades. 
There was also the question as to what the term illiteracy should em- 
brace. According to our definition, it means no schooling whatsoever, 
whereas in the case of the Federal estimates it seemingly means not 
knowing how to read and write. These possible sources of error, not- 
withstanding, there is a rather remarkable correspondence between the 
educational level of the country as a whole and that of our sampling. 
It thus appears, that by following a sampling based primarily on the 
occupational distribution of our subjects, the factor of educational status 
took care of itself. The educational distribution of our sample is not 
very far from that of the entire United States. In fact, the median 
educational level of our subjects is practically identical with that esti- 
mated by the Education Office for the country as a whole. 

A third factor which might be thought of as a possibly important item 
in the standardization of an intelligence test, is that of sex differences. 
With respect to this factor most of the available data relates to differ- 
ences observed on test performances of bo 3 ^s and girls. Briefly summa- 
rized, the data show occasional significant, though generally small 
differences between the sexes as regards the efficiency on individual 
tests. For example, boys tend to do better on Arithmetical Reasoning, 
and girls better on Vocabulary tests.® But whenThe* total score is 
taken into consideration, that is to say, when the individual tests are 
combined into batteries, these differences tend to cancel each other. 
It is not clear, however, whether this nullification of sex differences is 
due to a real averaging of these differences or to an artifact resulting 
from a special selection of tests. For example, in the New Stanford 
Revision, Terman and Merrill'* eliminated such tests as they said were 
^‘unfair’’ to one sex or another. And we have done the same. Thus we 
dropped the Cube Analysis from our battery when we discovered that 
the mean scores for men and women showed systematically large differ- 
ences in favor of the former. As our scales now stand, there are no 
statistically significant differences in total score between the sexes, al- 
though women tend to have higher mean total scores at almost every 

® But that is not necessarily true at all ages. See on this point Terman, L. M. : 
Intelligence of School Children. Boston, 1919. For sex differences among adults 
see Jones, H. E.: of Conrad, H. S.: Genet. Psychol. Monog.^ 1933, no. 6, pp. 260- 
261. 

* Measuring Intelligence. Boston, 1937, p. 22. 
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year level. These small differences in themselves do not prove any- 
thing, but taken together with other evidence, have led this writer to 
the belief that it may be possible to demonstrate a measurable superi- 
ority of wQmen^oyer men so far as general intelligence is coiidemed. 
We do not have the necessary data as yet to back up this generalization, 
but we have more than a ‘‘sneaking suspicion” that the female of the 
species is not only more deadly, but also more intelligent than the male. 

We now come to a group of factors which undoubtedly influences 
intelligence test results, but whose exact role is difflcult to evaluate 
owing to our, as yet, limited knowledge of what the actual relationships 
between them may be. We refer to the factors of “race”, social milieu 
and economic status. Here again, our view is that in an ideal stand- 
ardization, there ought to be separate norms in each of these categories 
to make allowance for their respective influences. We do not think, 
however, that it is possible to do this at present, particularly when 
those to whom we might look for the facts are at so great odds among 
themselves as to what these facts are. In the case of our own stand- 
ardization we have rather cut the Gordian knot than undone it. Thus, 
we have eliminated the “colored” vs, “white” factor by admitting 
at the outset that our norms cannot be used for the colored population 
of the United States. Though we have tested a large number of colored 
persons, our standardization is based upon white subjects only. We 
omitted the colored population from our first standardization because 
we did not feel that norms derived by mixing the populations could be 
interpreted without special provisos and reservations. 

In eliminating the “colored” vs, “white” factor from our population 
problem, we do not, of course, avoid the question of race differences alto- 
gether. All that can be said about intelligence differences between the 
white and colored population of the United States, applies in a general 
way to differences in intelligence level between other “racial” groups. 
That such differences e^dst between various groups which constitute the 
white population of the United States there can be little doubt. To 
what extent, however, these differences need to be considered in an in- 
telligence examination that could be used indiscriminately on all groups, 
cannot be easil3^ answered. At all events, any attempt to make allow- 
ances for racial differences presents many practical difficulties, perhaps 
the greatest of which is the large number of individuals that one would 
have to examine in order to obtain any reliable norms. We do not say 
that this would not be a worthwhile endeavor, but it was a task which 
we were not equipped to cany out. We believe, however, that the 
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differences that might be found between the English speaking white 
groups in the United States of different ^‘racial” origin would not be 
very large. It seemed to us that if we omitted the non-English speaking 
population we could treat the remaining portion of the white popula- 
tion as a relatively homogeneous group, — at least in so far as their 
suitability as cases for our sampling was concerned. This is undoubt- 
edly making a virtue out of necessity. The only justification we offer 
for this procedure is that we believe that, taken as a whole, our subjects 
represent a fair cross section of what may be called American in- 
telligence. 

Our subjects were chosen mostly from the City and State of New 
York, but these were matched against the total population of the United 
States in a manner presently to be described. Our total sampling is on 
the whole perhaps more representative of the State of New York than 
of the entire nation. But, according to results obtained with the Army 
Alpha Test, the mean intelligence level of the white population of the 
State of New York is not far from the average for the nation as a whole. 
For this reason we may assume that our mean scores will not differ 
much from those that may be obtained from the remaining population. 
Subsequent studies, however, will have to determine the legitimacy of 
this assumption. 

We shall now describe the manner in which our actual sampling was 
achieved. For this it is necessary to distinguish between our adult and 
children’s samplings. As regards our adult groups, the procedure was 
as follows: We tried to obtain individuals from all walks of life, and 
examined as many of these as were willmg to take the tests, providing 
they were not mentally or physically ill and could understand and write 
English. In this way over 1800 adults, both male and female, ranging 
m age from 17 to 80 were examined. Transcripts of their examination 
were recorded on cards and put into a general file. This general file 
served as an ^'urn” from which cases could be picked at random to fill 
whatever criterion of selection might be decided upon. 

After much consideration we felt that the most unbiased method of 
selection would be a sampling based upon the occupational distribution 
of the country’s adult population. For the necessary data we turned 
to the United States Census for 1930 as the best source. The numerous 
occupational sub-divisions given in the census report were first com- 
bined into ten broad categories such as Agricultural, Manufacturing, 
Mechanical, Clerical, etc. From the figures given in the census tables, 
we calculated for each sex and for each age, the percentage of the total 
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nximber of individuals engaged in the listed occupations. This furnished 
us the figures given in table 9. 

After this table had been devised, we merely went to our files and 
took from them such cases as would enable us to match the general 
occupational distributions of the census population. The number of 
adults (ages 20-59) finally available amounted to approximately 1,000. 
The result of our own matching is shown in table 10. As will be seen, 
our percentages for the various occupational categories, agree very 
closely, age for age and sex for sex (except for females above 50), with 
those determined for the country as a whole. 

Owing to the predominately urban character of our own population, 
a certain amount of substitution was necessary. For example, we did 
not have among our subjects any person employed in the extraction of 
minerals, and very few who might be classified as being engaged in 
agriculture. It was therefore necessary to find cases, among our own 
occupational groups, • which might be legitimately used in their place. 
A fair way to do this, it seemed to us, was to substitute individuals 
chosen from urban occupational groups which might have a mean in- 
telligence rating of the same order. To achieve this we turned to the 
tables furnished in the Memoirs of the National Academy of Sciences^ 
wherein are listed the mean scores of the various occupational groups 
represented in the U. S. Army draft. Checking on these mean scores, 
we looked for such urban occupations as showed intelligence ratings of 
about the same order as those of the agricultural and other groups which 
we were lacking. Thus we found that barbers, bakers and teamsters 
could be substituted for farmers, and that other missing occupational 
representatives® could be similarly replaced. 

A word as to the so cial s tatus of subjects available for our testing 
program. In general we can say that we included individuals from 
almost every social and economic class to be found in the City of New 
York. Lai^e numbers, however, were taken from special groups, such 
as persons attending Adult Education classes; workers, helpers and 
personnel of various hospitals of the city; members of several social 
organizations, like the “Y’s”; political clubs and church auxiliaries, and, 
in the case of the older people. Homes for the Aged. It should be noted. 

Tables 374-375 and 377, pp. 820-821 and p. 824. 

* Where the numbers employed in any given occupational group were relatively 
very small in the U. S. Census, we did not bother with making the substitution. 
Our general rule was to disregard such occupational groups as showed an incidence 
of only 1 per cent of the total or less. 
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TABLE 10 

Percentage of Bellevue adult sampling in each occupational category — ages ^0~59 
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however, that a substantial percentage of our subjects was obtained by 
visits to private homes. A certain number of the female subjects were 
parents of children brought to the Bellevue Mental Hygiene Clinic. 
A special source worth mentioning was the Coney Island beach, where 
one of our resourceful examiners went daily throughout a summer, and 
obtained volunteer subjects of both sexes. A detailed analysis of the 
sources is given in table 11. 

Subjects chosen by the above procedure included very few individuals 
of mental defective level. It was necessary to supplement our general 
sampling by a certain number of low grade individuals, which would 
roughly correspond to the estimated incidence of defectives in the total 


TABLE 11 

Sources of adult sampling 
Ages 16 to 69 


8017BCE 

NUMBER OF CASES 

Homes of subjects (mostly housewives) 

130 

Political, social and fraternal organizations. 

126 

Commercial establishments ■. 

14 

Federal project workers, hospital workers, including profes- 
sional, clerical, skilled, unskilled and domestic workers 

Hospital wards and clinics (non-mental cases) 

255 

176 

Transient Bureau (all types who happen to be passing through 
the city and are in need of temporary aid) 

67 

Summer resort (Coney Island Beach) 

58 

Adult vocational schools 

234 

Day high schools 

48 

Old age institutions 

73 


Total 

1,181 



population. The percentage of mental defectives in the U. S. has been 
estimated very differently by different investigators,^ but on the basis 
of our own experience, and of the consensus of previous investigators 
we fixed it at 3 per cent. To obtain a random sampling of the defec- 
tives themselves, we made use of patients coming to the Bellevue 
Psychiatric Hospital and of others already committed to various State 
Institutions. Over 200 such cases were available to us and from among 
these we were able to make a random selection at all ages and for both 
sexes. A special analysis of the entire group of mental defectives will 

^See Pintner, R.: * ^Intelligence Testing pp . S36-340. 



POPULATION USED IN STANDAEDIZING TESTS 


113 


appear subsequently in a study devoted to the problem of the better 
classification of mental defectives. 

The sampling of our children was a much simpler task, thanks in a 
good measure to the co-operation and assistance of the Bureau of Refer- 
ence, Research and Statistics, of the Board of Education of the City 
of New York.® Our procedure was as follows: Consulting the Bureau 
of Reference, Research and Statistics, we obtained a list of available 
schools, which in their opinion might be called representative or aver- 


TABLE 12 

Age-grade disirihuiion in 'public schools of New York City 
Per cent of each age in each grade year, ages 7-16 



9 

10 

1 

1 

3 

1 

14 

4 

54 

15 

27 

51 

1 

26 


2 


Ungraded 

First 9 1 

Second 63 12 

Third 28 69 

Fourth 

Fifth. . . 

Sixth 

Seventh 

Eighth 

Ninth — , 

Tenth 

Eleventh. 

Twelfth •. 

Continuation Schools. . 


Total 100 100 100 100 100 100 100 100 100 100 

This table was calculated from the 37th Annual Report of the Superintendent 
of Schools, City of New York, Statistical Section, tables 56, 108, 132, 179, pages 
109, 177, 204, 259. 



age. By representative schools is meant that the children attend- 
ing them were not predominately of one racial or social group, and by 

* We take this occasion to express our gratitude to the various Superintendents 
who gave us permission to test the children in the schools of the City, and for the 
kind co-operation of the Principals of these schools. We wish to mention in 
particular Miss Elizabeth A. Walsh, Director of the Bureau for Children with Re- 
tarded Mental Development, who assisted us in obtaining children of the un- 
graded classes. 
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average that they were schools, the pupils of which, as a group, were 
neither noticeably above nor significantly below the average in intelli- 
gence for the City of New York as a whole. At these schools we tested 
certain fractions of the population, depending upon the facilities and the 
age-grade range of the school. Usually not more than 100 to 150 chil- 
dren, chosen at random, were examined at any single school. At the end 
of a year we had some 1300 children from 6 to 17 years, tested at differ- 
ent times, at different schools® and by different examiners. These New 
York groups were supplemented by some 200 examined in Yonkers and 


TABLE 13 


Age^-Qrade distribution of Bellevue samplings 
Per cent of each, age in each grade year, ages 7-16 

{ AGS IM TXABS 


OBABX'TXiJl 



7 

s 

9 

10 

11 

13 

13 

14 

15 

15 

Urgrarled 



2.0 

3.3 

1.7 

1.7 

■1 

■1 




10.0 

2.0 









Second 

80.0 

26.0 

2.0 








Third ! 

10.0 

62,0 

14.0 

3.3 

1.7 






Fourth, 

10.0 

74.0 


5.0 

1.7 





Fifth 


8.0 


23.3 

5.0 

1.4 

1.4 



Sixth 




21.7 

46.7 

28.3 

7.1 

2.9 



Seventh - 




1.7 

18.3 

31.7 

15.7 

2.9 

2.0 

1.0 

Eighth. I...* 





li 

30.0 

42.9 

17.1 

7.0 

3.0 

Ninth 





IS 

mm 

28.6 

48.6 

21.0 

8,0 

Tenth 





H 

H 

2.9 

18.'6 

42.0 

IgCiHil 

Eleventh 






■ 


2.9 

12.0 

14.0 

Twelfth 

i 





■ 



4.0 

9.0 

Continuation Schools. . 





■ 

■ 


4.3 

■ 

25.0 

Total 

100.0 

8 

O 


100.0 

100.1 

100.1 

100.0 

100.1 

100.0 

100.0 


New Jersey schools, making a total of approximately 1500. As in the 
case of our adult sampling procedure, we placed all these cases in a 
“statistical um,” and from this um, made a random selection to conform 
to our sampling criterion. The basis of this fimal sample was a matching 
of our own population against the age-grade distribution of the City as a 
whole, as derived from the 1934 age-progress analysis of pupils in 
Elementary, Junior and Senior High Schools and various Continuation 

• Of the Schools, 4 were in Manhattan, 3 in Brooklyn, 2 in Queens and 2 in the 
Bronx. 
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Schools of the City of New York.^® This general age-grade distribution 
for the City as a whole is given in table 12; the corresponding figures for 
our own matched sampling, in table 13. 

The cases included in our standardizing children’s sample do not 
represent all the children under 16 who were tested, but only those who 
were ‘^matched” to fit our criterion, that is, the age-grade distribution 
of children in the New York Public Schools. On the other hand, it 
includes a small percentage of individuals (circa 2 per cent) who were 
not in the public schools. These are the lower-grade defectives who 
because of their limited endowment were not to be found in regular 
schools. Our sources for these lower-grade defectives (also some of the 
higher types) were, again, patients up for commitment at the Bellevue 
Mental Hygiene Clinic, or cases already committed to some New York 
State Institutions for the feebleminded.^^ 

Superintendent’s Report, 1934. 

^^Principally Letchworth Village. We wish to thank Dr. Edward J. Hum- 
phreys, Director of Research and Dr. Elaine F. Kinder, Research Psychologist* 
for the co-operation which made these cases available to us. 



CHAPTER 9 

STANDARDIZATION AND RESULTS 


The mtelligence examinations that are presented in this book are point 
scales. This means that an individuaFs intelligence rating is obtained, 
ultimately, from a summation of the credits (or points) -which he is given 
for passing various test items. The first problem which confronts us in 
such a scale is to decide what portion of the total number of credits 
should be assigned to each of the tests. This is the statistical problem 
of “weighting"'. One way of meeting it is to let the test weights take 
care of themselves by simply allowing one point for each item correctly 
passed. Such, for example, is the procedure employed on the Army 
Alpha, where the number of items on the several tests determines the 
final amount that each test contributes to the scale. Another way is 
to use some predetermined scoring system that will fix in advance the 
proportion which each test contributes to this total score, irrespective 
of the number of items it may happen to contain. This, of course, 
requires an additional set of criteria, but has the advantage of com- 
pelling the author to evaluate his tests separately. 

To determine the “value" of a test, however, is an extremely onerous 
procedure. From a statistical point of view the most valid method 
that can be employed seemingly is that of multiple correlations. But 
the results obtained thereby usually do not, in our experience, justify 
the amount of labor involved. In the end, the tests must be evaluated 
in the light of clinical experience. Whether a test is “good" or not 
must be determined eventually by the efficacy with which it measures 
the thing one wants it to measure. 

Now that we have examined some 5000^ subjects, we are in a better 
position to express a competent opinion regarding the merits of our 
individual tests. At the outset of standardization it seemed to us, in 
the absence of any definite knowledge, that the best assumption to 
make about the separate tests of the scale was that they were equally 
important.^ Accordingly each test in our scale was made to contribute 

1 This number includes 2000 cases not used in the standardization. 

* From a qualitative point of view, subsequent statistical analysis has not, as 
indeed was to be expected, fully confirmed this assumption. Some of our tests 
are better measures of intelligence than others. (See on this point comments on 
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the same possible maximal number of points as any other, irrespective 
of the number of items of which it happened to consist. There are a 
number of advantages in this procedure, the most important of which, 
perhaps, is that it allows for the addition or omission of tests from the 
scales without significantly altering the norms. When tests scores are 
expressed in terms of S.D. units and each test is equated to every other, 
any given test may be left out and any new test may be added and 
allowed for (up to a certain point) by simply prorating for it. 

Weighting of the separate tests was performed in the following man- 
ner: Using the age group 20-34 as the standardizing group, we first 
distributed the raw scores of each test separately and calculated the 
usual statistical constants for them. From a comparison of these con- 
stants we then found that a scale of about 20 poiats would best fit our 
data. We accordingly converted the raw scores for each test into such 
a 20 point scale, setting a new mean at 10 and a new standard deviation 
at 3. This conversion was made by emplo 3 dng Hull’s method.® The 
result of the conversion obviously was to equate all the tests against 
each other, so that all had identical means and approximately equal 
ranges of variability. Finally, we drew up a table of equivalents in 
which the original scores on each of the tests were expressed in terms 
of equal multiples or fractions of their new S.D.’s. It is these scores 
which were used in obtaining all our norms, and which must be em- 
ployed to obtain a subject’s final rating on any of the scales. 

We shall now summarize the main results of our standardization. 
The first fact of interest is the rise and fall of total scores on the various 
tests with increasing age. These are given in tables 14-16. As will 
be noted, there is a progressive rise of mean score with age, which con- 
tinues up to about age 18.^ The curve plotted for the Full Scale means 
is given on page 29, and has already been discussed. As might be 
expected, the standard deviations for the lower ages are rather large 
for their mean scores. This indicates that the tests cannot be used 
below a certain age. The critical point seems to be about year 10, and 
accordingly our scales are to be used only from that age onward. 


individual tests.) However, such attempts as we have made -at more refined 
scoring do not seem to affect significantly our total score distributions- This 
experience agrees with that of the standardizers of the Army Alpha. 

* The details of our statistical procedures are given in Appendix 1. 

* Actually, age 22. The difference in mean score, however, between age 18 
and 22 is negligible. The scores compared are the smoothed and not the calcu- 
lated means. 
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The mean scores, even at an early age, however, furnish an oppor- 
tunity for calculating equivalent mental ages, and although we have 
abandoned the M.A. as a base for calculating LQ/s, these equivalents 
are of some interest. They are accordingly given in table 17 for our 
three major scales. An important point to note is that there are no 
mental age equivalents for ages above 15|, and that beginning as early 

TABLE 14 

Summary of mean standard scores and sigmas 
Ages 7 to 59 


Full scale, 10 tests 


AGB 

CALCX7Iu».TED MEAN 

CALCULATED SIOMA 

SMOOTHED MEAN 

SMOOTHED 8IQMA 

7.5 

27.2* 

11.55 

23.0 

13.0 

8.5 i 

34.8 

13.78 

34.5 

14.8 

9.5 

46.8 

16.82 

48.6 i 

16.0 

10.5 

63.3 

15.72 

62.3 

16.8 

11.5 

73.8 

17.43 

73.2 

17.4 

12.5 

82.2 

17.89 

82,6 

17.8 

13.5 

89.9 

^ 17.94 

89.8 

18.0 

14.5 

93.3 

18.67 

93.4 

18.3 

15.5 

95.1 

18.00 

95.0 

•18.8 

16.5 

95.9 

19.14 

96.2 

19.0 

18.5 

96.3 

19.88 

97-8 

19.6 

22.5 

98.8 

19.00 

97,9 

; 20.8 

27.5 

95.9 

21.35 

95.0 

I 21.9 

32.5 

90.4 

24.00 

91.6 

22.5 

37.5 

86.7 

24.00 

88.0 

23.4 

42.5 

85.1 

23.90 

84.8 

23.9 

47.5 

79.0 

22.20 

81.3 

24.0 

52.5 

77.4 

23.73 

78.0 

24.2 

57.5 

74.9 

24.77 

74.8 

24.5 


*The mean score for age 7 is obviously too high owing to the necessary omis- 
sion of individuals in this age group of low defective intelligence. The Bellevue 
Scale does not extend sufficiently far down to permit their being tested with it. 


as age 14, the differences between succeeding half year scores are so 
small as to make them unreliable. This would likewise be true of most 
other general intelligence examinations if the original raw scores were 
similarly transmuted into S.D.^ units. 

A second point of interest is the change in test score variability with 
age. This change is best revealed by the ratio of the mean to the standard 
deviation, that is, the value of the coefficient of variation at each age. 



TABLE 15 

Summary of mean standard scores and sigmas 
Ages 7 to 59 


Verbal scale, 5 tests 


AQB 

CAICtriATBD MSAK 

SIGMA 

BMOOTHSD MSAK 

SMOOTHZD SIGMA 

7.5 

11.2 

4.92 

9-1 

6.1 

8.5 

16.0 

7.70 

15.6 

7.5 

9.5 

22.1 

8.77 

22.1 

8.8 

10.5 

30.08 

8.78 

30.0 

9.4 

11.5 

36.50 

10.75 

36-4 

9.9 

12.5 

40.66 

10.29 

40.9 

10.1 

13.5 

43.5 

9.95 

43.5 

10.2 

14.5 

45.07 

10.17 

45.0 

10.4 

15-5 

45.6 

9-90 

45.5 

11.0 

16.5 

46.25 

10.33 

46.2. 

11.2 

18.5 

45.8 

11.69 

46.8 

11.5 

22.5 

48.6 

10.33 

47.0 

11.9 

27.5 

48.14 

12.36 

47.0 

12.4 

32.5 

44.96 

13.90 

46.5 

12.5 

37.5 

45.1 

13.31 

45.5 

13,0 

42-5 

45.1 

12.87 

44.5 

13.4 

47.5 

43.8 

13.15 

43.5 

13.6 

52.5 

41-85 

13.75 

42.2 

14.0 

57.5 

40.6 

14.14 

41.0 

14.5 


TABLE 16 

Summary of mean standard scores and^sigmas 
Ages 7 to 49 


Performance scale, 5 tests 


AGB 

CAI^CXIIATED MEAN 

CAU;UIATBX> SIGMA 

.. 

SMOOTHBD MSAK 

SMOOrmSD SIGMA 

7.5 

15-6 

8.49 

13.7 

8.3 

8.5 

20.5 

8.0 

19.6 

9.0 

9.5 

24.9 

9.5 

25.8 

9.4 

10.5 

34.0 

9-0 

31.6 

9.7 

11.5 

37.42 

8.25 

37.1 

9.9 

12.5 

41.83 

9.42 

42.4 

9.9‘ 

13.5 

46.86 

lO.OS- 

46.2 

10.0 

14.5 

48.35 

9.57 

48.3 

10.0 

15.5 

49.55 

10.75 

49.5 

10.1 

16.5 

50-7 

10.69 

50.7 

10.4 

18.5 

50.95 

10.29 

51.5 

10.5 

22.5' 

50.88 

10.23 

50.8 

10.9 

27.5 

48.1 

11.1 

48.3 

11-2 

32.5 

45.45 

12.58 

45.5 

11.6 

37.5 

42.91 

12.17 

42.6 

12.0 

42.5 

40.1 

12.19 

39.8 

;i2.6 

47.5 

36.6 

12.54 

36.8 

12.8 
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TABLE 17 


Equivalent* mental ages for Bellevue Intelligence Scales 


M.A 

imLSCAUS 

PSBFOBMAJICSS 

ecAUs 

TSBBAli 8CALB 

M . A . 

7—0 

22 

15 

10 

7—0 

7—3 

24 

16 

11 

7—3 

7-6 

26 

17 

12 

7-6 

7-9 

29 



7-0 

8-0 

31 

18 

14 

8-0 

8-3 

34 

19 

15 

8—3 

8-6 

37 

20 

16 

8—6 

8-9 

40 

22 

17 

8-0 

9-0 

43 

23 

19 

9-0 

9-3 

46 

24 

21 

9-3 

9-6 

49 

26 

22 

9-6 

9—9 

52 

27 

24 

9-0 

10-0 

56 

28 

26 

10-0 

10 — ^3 

59 

30 

28 

10-3 

10-6 

62 

31 

30 

10-6 

10-9 

65 

33 

32 

10—9 

11-0 

68 

34 

33 

11-0 

11—3 

71 

36 

35 

11—3 

11-6 

73 

37 

36 

11—6 

11—9 

76 

39 

38 

11—9 

12-0 

78 

40 

39 

12-0 

12—3 

80 

41 

40 

12-3 

12-6 

82 

42 

41 

12-6 

12-9 

85 

44 

42 

12—9 

13-0 

86 

45 

43 

13-0 

13-3 

88 



13—3 

13—6 

90 

46 

44 

13—6 

13—9 

91 



13—9 

14-0 

92 

47 


14-0 

14—6 

93 

48 

45 

14 r -6 

15-0 

94 

49 


15-0 

15-6 

95 

50 

46 

15-6 


* These M.A. equivalents may not be used to calculate I.Q/s. 
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These values (Pearson’s C.’s of V.) are given in table 18 . Our results 
show that in children there is a decline in variability and in adults an in- 
crease in variability with age. These results are in line with what has 
been found in other studies,^ but interpretation of them is not easy. As 
regards the decrease of variability with age in the case of children, one 
possible explanation is that it is due to the leveling influence of educa- 
tion® and environment. Another possible cause is the factor of natural 
selection. If for no other reason than the effect of death in cutting 


TABLE 18 


Coefficients of variation 



Weighted scores (smoothed) 


Ages 10-60 


▲GB 

rOLL SCALE 

1 VERBAL SCALE 

PERFOBMANCB SCALE 

10 

26.97 

31.33 

30.70 

11 

23.77 

27.20 

26.68 

12 

21.55 

24.69 

23.35 

13 

20,04 

23.45 

21.64 

14 

19.59 

23.11 

20.70 

15 

19.79 

24.18 

20.40 

16 

19.75 

24.22 

20.51 

17-19 

20.04 

24.57 

20.39 

20-24 

21.25 

25.32 

21.46 

25-29 

23.05 

26.38 

23.19 

30-34 

24.56 

26.88 

25.49 

35-39 

26.59 ’ 

28.57 

28.17 

40-14 

28.18 1 

30.11 

31.41 

45-49 

29.52 

31.26 

34.78 

50-54 

31.03 

33.18 


55-59 

32.75 

35.37 



down extreme deviates, one ought to expect variability to decrease with 
age. But this factor should also operate in old age. 

The fact that actual measures do not confirm this expectation, sug- 
gests other possible influencing factors. One is the lack of equivalence 

* As for example those of Ruger, H. A. and Stoessiger, B.: On the Growth of 
Certain Characteristics in Man. Annals of Eugenics, 1926, parts I II. 

• See on this point Wechsler, D.: The Influence of Education on Intelligence. 
Jr, Educ, Psychol, 1926, XVII, pp. 218-257. 
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TABLE 19 

Bellevue intelligence quotients 

Means and standard deviations of Full-Scale intelligence quotients 


JLGB 

KUMSZB 

10 

60 

11 

60 

12 

60 

13 

70 

14 

70 

16 

100 

16 

100 

17-19 

100 

20-24 

160 

25-29 

195 

30-34 

140 

35-39 

136 

40-44 

91 

45-49 

70 

50-54 

56 

55^9 

60 

50-59 

105 


MBAN I.Q, 

S.D. 

101.25 

13.20 

100.84 

14.10 

100.08 

13.80 

100.67 

14.70 

99.93 

14.75 

100.00 

14.57 

100.30 

15.15 

98.75 

14.60 

100.16 

13.70 

100.89 

14.60 

99.57 

15.60 

99.76 

15.50 

100.30 

14.80 

100.07 

14.01 

100.50 

13.97 

99.1 

16.85 

99.84 

15.26 


TABLE 20 


Bellevue intelligence quotients 
Means and standard deviations of Verbal 




ZrUUBSA 


lOSAH LQ. 


10 

11 

12 

13 

14 

15 

16 

17-19 

20-24 

25-29 

30-34 

35-39 

40-44 

45^9 

50-54 

55-59 


60 

60 

60 

70 

70 

100 

100 

100 

160 

195 

140 

135 

91 

70 

55 

50 


100.08 

100.25 

100.50 

100.07 

100.14 

100.00 

99.95 

98.45 

102.09 

101.68 

98.82 

99.98 

100.62 

100.64 

100.05 

98.00 


60-69 


105 


99.07 


I.Q.^s 


S.D. 


14.08 

16.65 

15.44 

14.00 
14.28 
14.60 
13.80 
14.70 

13.22 

14.22 

16.01 
15.18 
14.86 

13.84 
13.95 

14.85 

14.43 
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between physiological and chronological age. Some individuals grow 
^^ol d^^ at .^-very much earlier age than others. Other factors are the 
mixed influences of education, vocational pursuit in a particular field 
over a prolonged time, and restricted environment. A fuller discussion 
of this question, however, would take us too far afield and we merely 
present it as a point for study and further investigation. 

More important for our purpose than the variations of absolute scores 
with age is to discover the corresponding change in I.Q. when calculated 
in the manner described in chapter 4. In tables 19-21 we give the 
mean I.Q.’s and their S.D.’s for all ages — ^from 10 to 60 — on our three 


TABLE 21 

Bellevue intelligence quotients 


Means and standard deviation of Performance LQ.'s 


AGB 

KtrMBEB 

USAK I.Q. 

aD. 

10 

60 

102.50 

13.57 

11 

60 

101.50 

12.60 

12 

60 

100.17 

13.42 

13 

70 

101.07 

14.62 

14 

70 

100.50 

14.19 

15 

100 

100.05 

15.70 

16 

100 

99.75 

15.15 

17-19 

100 

99.10 

14.30 

20-24 

160 

99.00 

14.37 

25-29 i 

195 

100.22 1 

14.31 

30-34 

140 

100.00 

15.78 

35-39 

135 

99.13 

16.21 

40-44 

91 

100.47 

15.04 

45-49 

70 

100.64 

13.10 


major scales. The corresponding coefiScients of variation for each scale 
are given in table 22. As will be noted, there is remarkable constancy 
of the I.Q. throughout. Not only the means, but also the S.D.’s remain 
approximately the same throughout the entire age range for which tlje 
scales are employed. It thus appears that our method of calculating 
I.Q.’s not only takes into account the change of test score with age, 
but that of the differences of variability of the tests scored at different 
ages. It should be noted that the mean I.Q. remains constant, not 
only in the Full Scale, but also in the case of the Verbal and Performance 
Scales. This may be mterpreted as justification for our inclusion of 
both Verbal and Performance items in a single scale. More direct 
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evidence on this question, however, may be had from a study of the 
mutual intercorrelations of the several parts when treated as separate 
scales. The correlations between them were as follows: 

Verbal I.Q. X Full Scale I.Q., r = .90 ± .007 
Performance I.Q. X Full Scale LQ., r = .88 zb .008 
Verbal LQ. X Performance I.Q., r = .71 dz .018 
When test scores were used instead of I.Q.’s, the correlations were 
practically identical, thus: 


TABLE 22 

Coefficients of variation ( ^ X 100 
\M 

Intelligence quotients 


Ages 10-60 


AG2 

FULLBCAliB 

TEBBAL SCALE 

PEHFORMJLNCE SCALE 

10 

13.04 

14.07 

13.24 

11 

13.98 

16.61 

12.41 

12 

13.79 

15.36 

13.40 

13 

14.62 

13.99 

14.47 

14 

14.76 

14.26 

14.12 

15 

14.57 

14.60 

15.69 

16 

15.10 

13.81 

15.19 

17-19 

14.68 

14.93 

14.43 

20-24 

13.68 

12.95 

14.52 

25-29 

14.47 

14.00 

14.28 

30-34 

I 15.67 

16.20 

15.78 

35-39 

1 15.54 

15.18 

16.35 

40-44 

14.76 

14.78 

14.97 

45^9 

I 14.00 

13.75 

13.02 

50-59 

15.28 

14.57 



Verbal Test Score X Full Scale Test Score, r = .91 zb .007 
Performance Test Score X Full Scale Test Score, r = .90 ± .007 
Verbal Test Score X Performance Test Score, r = .67 zb .020 
IVhen corrected for attenuation the correlations between Performance 
and Verbal increase significantly, thus: 

Verbal LQ. X Performance I.Q., r = .83 
Verbal Score X Performance Score, r = .81 
Because of the high^ correlation between our Performance and Verbal 
I.Q.’s it might be supposed that the two scales could be regarded as 

^ The high correlation obtained is due in part to the heterogeneity of our popu- 
lation as regards both age and I.Q. range. Other studies show considerably 
lower correlations between the Verbal and Performance I.Q.’s. 
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equivalent, but that is not necessarily the case. A high correlation 
between two tests signifies only that the tests measure the same trait 
or ability; it does not mean that they measure them to the same degree. 
It is possible for two tests to be perfectly correlated and still give far 
from equivalent ratings. This would happen if the individual measures 
entering into the correlation showed systematic constant differences. 
Thus, if I.Q.^s furnished by one scale were systematically 10 points 
lower than I.Q.’s obtained on a second scale, then the same individual 
tested with the two scales would obtain different I.Q.'s in spite of the 
fact that the two scales were perfectly correlated. If he obtained an 
I.Q, of 95 on scale A, his corresponding I.Q. on B would be only 85, or 
if 115 on B, an I.Q. of 125 on A, and so on. 

In evaluating the equivalence of two scales where results are expressed 
in the same units, one must take into account not only the degree to 
which the scales correlate with one another, but the absolute amounts 

TABLE 23 


Analysis of differences between verbal and performance l.Qfs 
Median difff^rences and interquartile (Q) ranges for undifferentiated 

groups 


AGES 

MEDIAN DIFFERENCE 

Q 

NOUBEB 

— 

10-16 

9.1 

5.3 

520 

20-49 

8.4 

4.9 

590 


by which the individual measures differ. In practice it is best effected 
by averaging the observed differences without respect to sign. The 
median difference between Performance and Verbal I.Q.’s for our Scale 
is approximately 8.7 points. It varies with age and with the levels of 
I.Q. compared. The basic data from which these factors may be 
appraised are given in tables 23, 24 and 25. Discussion of them is left 
for our next chapter when we shall again have occasion to take up the 
problem of Verbal and Performance I.Q. 

At this point we may just mention the fact that while the I.Q. differ- 
ences between the Bellevue Verbal and Performance Scales are seem- 
ingly large, they are probably considerably smaller than would be 
obtained from a similar study between the Binet and any other Per- 
formance Scale now in use. We say probably, because we have been 
imable to find such studies in the literature covering any considerable 
number of cases. The best material available for such a study is that 
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furnished by Grace Arthur in the standardization of the Scale bearing 
her name,^ where correlation tables between Performance and Verbal 
(Binet) tests are given. From these tables we were able to deduce the 
approximate I.Q. differences between scores on Performance and Verbal 
for some 185 cases between ages 10-15 and found the median to be 11.0 
points with an interquartile range of 5.84. 

The usefulness of a scale depends not only upon the validity of its 
measures, but also upon the manner in which these measures distribute 
themselves within the limits of their range. In general it is desirable 
that the range be as wide as possible, that the measures be continuous, 


TABLE 24 

Median differences and interquartile ranges for different I.Q. categories 


AGES 10-16 

AGES 20-49 

I.Q. category 

Median 

differ- 

ence 

Q 

Number 

I .Q. category 

Median 

differ- 

ence 

Q 

Number 

79 and below 


4.8 

■i 

79 and below 


mm 

5.5 

74 

90-110 


4.6 


91-110 



4.6 

405 

120 and over 


7.8 


120 and over 


11.1 

4.7 

62 


TABLE 25 

Per cent of cases with verbal higher and performance higher 


1 

AGES 10-19 

AGES 20-49 

I.Q. category 

Perform- 

ance 

higher 

Verbal 

higher 

[ No 
difference 

I.Q. category 

1 

Perform- 

ance 

higher 

Verbal 

higher 

No 

difference 

79 and below 

67.5 

25.0 

7.5 

75 and below 

74.3 

23.0 

2.7 

91-110 

52.3 

43.9 ! 

3.8 

91-110 

46.7 

48.9 

4.4 

120 and over 

12.2 

82.9 

4.9 

120 and over 

21.0 

77.4 

1.6 


and that there be no piling up of scores at any point. Some authors 
also believe that the resulting frequency curve ought to be Gaussian or 
as nearly Gaussian as possible. The last requirement seems to be a 
result of the wide-spread but mistaken belief that mental measures 
distribute themselves according to the normal curve of error. 

The distribution of our adult I.Q.'s given in figure 9 conforms to the 
first three criteria just mentioned: (1) The range of scores is approxi- 
mately 8 S.D. (or in terms of I.Q., from I.Q. of circa 35 to an I.Q. of 

•Arthur, Grace: A Point Scale of Performance Teets, N. Y., 1933, vol. II, 
pp. 58-61. 
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circa 155®). (2) Our I.Q.’s are continuous within the defined limits. 

All intervening I.Q/s are not only possible, but actually occur. (3) 
There is no piling up of scores at any point of the scale, and particularly 
not at the lower extremes (as in the case of the Army Alpha). The 
distribution of our measures, however, is not Gaussian. As can be 
seen from the figure 9, the histogram is considerably skewed in a nega- 
tive direction. A curve fitted to this data would be of the form of 
Pearson’s Type IV. We have not gone to the trouble of actually fitting 

DISTRIBUTION OF FULL SCALE INTELLIGENCE QUOTIENTS 
AGES 10 TO 60 1508 CASES 



it but have calculated the necessary constants from which this may be 
done. These are given in the footnote below.^° 

We come at last to the most trying problem, that of validation. How 
do we know that our tests are ‘‘good” measures of intelligence? The 
only honest reply we can make is that our own experience has shown 
them to be so. If this seems to be a very tenuous answer we need only 
remind the reader that it has been practical experience which has given 

* The actual I.Q. range for any given age is a function of the age at which it is 
calculated. It varies from age to age. 

10 Mean = 100.11 pi — 0.2789 Ki = 0.3973 a /nS/Si = 3.54 

S.D. = 14.69 fit = 3.6170 K, = 0.5668 VNift = 14.00 

N = 1508 
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(or denied) final validity to every other intelligence test. Regrettable 
as it may seem, empirical judgments, here as elsewhere, play the r 61 e 
of ultimate arbiter.^^ In any case, all evidence for the validity of a 
test, whether statistical or otherwise, is inevitably of an indirect sort 
and, in the end, cumulative rather than decisive. 

The usual procedure for validating a new test has been to set as a 
criterion some well established test which has been accepted as a ‘^good” 
measure of the trait in question and then appraise the validity of the 
new one on the basis of the degree to which it correlates with the already 
established test. The significance of this correlation will depend en- 
tirely upon the original criterion, and it is therefore the criterion itself, 
rather than the new test, which needs examination. In practice the 
general tendency has been to accept tests already in use as being more 
or less established measures of the traits in question, but for the most 
part these criteria themselves have never been validated. The situation 
in the case of intelligence tests is not so bad as in other fields of testing, 
but even here the absence of validated criteria imposes serious limita- 
tions on the conclusion that the tests really measure intelligence. The 
various revisions of the Binet Scales, which, in spite of all criticism, we 
accept as among our best single measures of intelligence, are no excep- 
tions. Their final validation rests primarily on the fact that they have 
worked well in practice, and not because of any compelling statistical 
correlations. 

As evidence we need only recall some of the early arguments in their 
favor. Thus, one of the reasons which Binet gave for devising his test 
was that teachers could not very well be trusted to estimate the true 
intelligence of their pupils. But several years later Terman and others 
came along and gave as one of the proofs of the excellence of their re- 
visions of the scale the fact that I.Q. ratings correlated very highly 
with school progress. In other words, teachers’ judgments which were 
first condemned were now used as supporting criteria for the validity 
of the tests. This obviously begs the question, but the dilemma in- 
volved is almost inevitable. 

The reader should not interpret this remark as an attempt on the part of the 
author to belittle either the importance or value of proper statistical methods. 
On the contrary, he is of the opinion that without intelligent and thorough sta- 
tistical evaluation of one's data, little trust can be placed in one's results. The 
extensive statistical work done on the Bellevue Scales is perhaps the best evidence 
for that. But he is of the opinion that the task of validating a scale does not end 
but only begins with the statistical evaluation. 
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The trouble is not with the teachers’ ratings. The same circularity 
is involved when ratings of any judges are set up as a criterion. In the 
case of the Alpha Tests, the estimates used at the beginning were those 
of Army ofl&cers; in the case of certain tests offered to industry, the 
estimates were of supervisors and executives. But these estimates 
were no more valid than those of the teachers. The tests were urged 
upon the Army and industry because it was pointed out that the esti- 
mates of officers and personnel managem could not always be relied on; 
then the tests were “sold” to them because they correlated well with 
practical judgments. 

From what we have just said, it appears that every new intelligence 
test, unless it is merely intended as an alternate for an old one, must 
itself be validated anew. Such validation is especially necessary if, as 
in our case, we start with the view that the generally accepted criterion 
is not as valid as supposed. The Bellevue Scales were devised because 
of the belief that the Binet Scales were not sufficiently “good” measures 
of intelligence for adults. Otherwise, indeed, we should not have gone 
to the trouble of devising our tests. But we do not thereby deny the 
desirability of showing a correlation between them and other tests of 
intelligence. 

No new test can be markedly out of line with established measures of 
intelligence and still claim to be “good” measures of it, because that 
would be tantamount to saying that all other tests were not reliable 
measures of intelligence. But the degree to which any new test corre- 
lates with established tests (e.g., the Binet) cannot in and of itself be 
accepted as a basic proof of the new test’s validity. The test must be 
able to stand on its own feet. Its correlation with established measures 
of intelb'gence is only a minimum requirement. To meet this minimum 
requirement, a correlational study between Bellevue I.Q.’s and I.Q.’s 
obtained by the Stanford-Binet Test was made. The correlation for 
75 cases (ages 14-16) was .82 db .026. This is about as high as most 
other accepted tests of intelligence correlate with each other, and higher 
than a good many.^^ The results show that the Bellevue Scales measure 
about the same “thing” as the Binet, but of course leave unanswered 
the question as to which of them measures it better. 

For other measures of validity, we made a number of separate corre- 
lation studies between test scores and judges’ ratings. One of these 

For correlations between the Wechsler-Bellevue and other Beales see table 
28, p. 134. 
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studies was between teachers’ estimates and Bellevue Intelligence 
scores or LQ.’s. The subjects for this study came primarily from our 
adolescent groups. One group consisted of students of a Trade School 
in the City of New York; another, of students from the General Com- 
mercial High School in Yonkers. An effort was made to have at least 
two ratings by different teachers for each subject. These ratings were 
averaged, distributed on a 6 point scale, and correlated against test 
scores by means of a four-fold contingency table. The correlation co- 
efficients (Bellevue I.Q.’s against teachers’ estimates) for the two groups 
were as follows: 

Teachers’ Ratings (General H. S.) X Bellevue I.Q.’s, C = .43. 

Teachers’ Ratings (Trade H. S.) X Bellevue I.Q.’s, C = .52.^^ 

Our last method of validation was through comparisons of subjects’ 
scores on the Bellevue Scales with estimates of their intelligence by 
psychiatrists, with records of their school history and with ratings 
obtained from other tests. These comparisons show that the Bellevue 
Scales not only provide reliable measures of intelligence when evaluated 
against clinical data, but do so more consistently than any other single 
scale. For example, in comparing the Stanford-Binet and the Belle- 
vue Scales as regards degree of agreement with psychiatrist’s diagnostic 
ratings, we found that the forecasting efficiency of the Bellevue Scale 
was more than five times that of the Stanford-Binet.^^ The psychia- 
trist’s ratings consisted of the diagnoses Recommended’ or ‘not-recom- 
mended’ for commitment to a state institution for mental defectives; 
the test ratings were the I.Q.’s obtained by the diagnosed subjects on 
the two scales. The resulting correlations (bi-serial r’s) were as follows: 

Binet I.Q.’s X Psychiatrist’s recommendation, r = .33 zh .071 

BeUevue I.Q.’s X Psychiatrist’s recommendation, r = .79 dz .048 

But most of the ‘Correlations” cannot easily be put into statis- 
tical form. They constitute a type of evidence which can best be 
put forward by citation of individual cases. We have many such 
cases at hand, but to cite any considerable number of them in full would 
require more space than we can give here. The following are only by 
way of illustration: 

Case Z). W, Male, age 39, bom in Poland, has lived 22 years in this country. 
Occupation: window cleaner. Referred by Court on charge of disorderly 

« General H. S., N = 45; Trade H. S., N = 74. 

The Relative Effectiveness of the Stanford-Binet and the Bellevue ScaUt etc,, 
by B. Balinsky, H. Israel and D. Wechsler. Amer. Jr. of Orthopsy., 1939, IX, 
798-801. 
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conduct. Chronic alcoholic, also suffering from sj^Dhilis. Examined on 
9/29/36 by a staff psychologist with the following results: 

Stanford-Binet, 8 yrs. 4 mos. I.Q. (on 16) 52 
Pintner-Paterson, M.A. 6-6 

The examining psychologist’s remarks are as follows: ^^One cannot rate the 
patient on the basis of test scores because the results of the psychometric test 
are at variance with climcal impressions. The patient is definitely inferior but 
in the examiner’s opinion not of defective intelligence.” 

Re-examined on 10/1/36 with the Bellevue Intelligence Tests, the patient 
attained a score which gave him an I.Q. of 68 and a rating of borderline intel- 
ligence. This corresponded to both the psychologist’s and the psychiatrist’s 
estimate of the patient. It was clear also from the man’s social history that 
he could not have been of so low an intelligence level as the Binet and Pintner- 
Paterson Scale made him out to be. In spite of his alcoholism and his syphilis 
he had managed, until recently, to get along tolerably well in the community 
and had worked fairly consistently until two years ago when, as the result of 
a fall, he had to be hospitalized. The subject obviously needed treatment both 
for his syphilis and alcoholism, but was not a suitable candidate for admission 
to an institution for the mentally defective. 

Case S, A. Female, age 28, born in U. S. Occupation: nursemaid. 
Charged with prostitution and referred by Court for observation. Two years 
prior to present admission had been seen in Bellevue Mental H 3 ’’giene Clinic. 
At that time (1936) was examined with the Stanford-Binet and attained an 
I.Q. of 86. During present period of observation (October 1937) she was first 
retested with the Binet and then given the Bellevue Adult tests, with following 
results: 

Stanford-Binet, I.Q. (on 15) 86 
Bellevue Adult, I.Q. 68 

Abstract of examiner^ s commerds: Because of the apparent discrepancy be- 
tween the clinical impression and the relative competence which the patient 
showed on the Binet tests, she was given a Bellevue Adult Examination which 
contains performance as well as verbal tests. Her scores on the separate parts 
of the Bellevue showed marked discrepancy, being relatively good on the verbal 
and markedly inferior on performance tests, but the final score which she 
attained on the Bellevue Full Scale gave a more appropriate index of her men- 
tality than that obtained from either the Bellevue Verbal tests or the Stanford- 
Binet alone. The performance part of the Bellevue Test revealed the patient’s 
vocational inaptitude, her defective motility and her marked inability to 
adjust to unfamiliar patterns. Altogether she functions as a borderline de- 
fective meriting commitment. No evidence of psychotic behavior was noted. 

Case M, 0. Male, age 15 years, 9 months, bom in U. S.; school placement: 
ungraded class. He was brought to the hospital by the probation officer of 
the Children’s Court because of persistent truancy. He was a neglected child 
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without proper supervision. In 1927 the boy was in another city court and 
was remanded to the hospital as a possible mental defective. Returned to 
court with diagnosis, ‘‘not defective.” 

Bellevue Intelligence Examination (2/27/36): Score 96, I.Q.: 101, Rating: 
average intelligence. 

Previous to this examination he attained I.Q.’s on the Binet ranging from 
72 to 86. On recent Binet test (given 2/24/36) he made an M.A. of 13 — 7 and 
an I.Q. of 86. The psychologist’s interpretation, notwithstanding the Stan- 
ford-Binet I.Q. of only 86, was: average intelligence. On the same day he was 
given a number of educational tests with the following results: 

Woody McCall Arithmetic: 4th grade 
Monroe Silent Reading: 8th grade 
Monroe Written Spelling: 3rd grade 

The child had been in the ungraded classes during the greater part of his 
school life. He resented being there and beheved that he could do the work 
of the normal classes. In spite of his present school placement both the psy- 
chologist and the psychiatrist felt that this boy had good native endowment 
and did not belong in the ungraded class. The BeUevue tests showed that he 
was of average intelligence and that his educational disabilities could not be 
ascribed to mental retardation. 

Case P. Q. Female, age 30, bom in U. S. Occupation: registered nurse 
and social worker. Education: two years of college. 

Revised Stanford-Binet (Form L), I.Q. 140 
Bellevue Adult, I.Q. 117 

Examiner^ s comments: This subject is employed in one of the mental hy- 
giene clinics and known to both the examiner and psychiatrist who have had 
daily association with her. Our general clinical impression and that of the 
psychiatrist would be that the Bellewe Adult I.Q. of 117^^ is a more than 
generous estimate. The rating of high average is more in line with the general 
impression. 

Case M, B, Female, age 40, born in Italy, lived in U. S. 25 years. Occu- 
pation: housewife. Education: 3rd grade. 

Revised Stanford-Binet (Form L), I.Q. 58 
Bellevue Adult, I.Q. 71 
Diagnosis: borderline intelligence 

Examiner^s comments: Stanford-Binet I.Q. much too low. She worked 
from the time she was 12 until she was 30. The psychiatrist’s impression is 
that the patient is of inferior intelligence but not as low as the Stanford-Binet 
would indicate. 

“ An I.Q. of 117 on the Bellevue rates Bright Normal. 
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RELIABILITY 

The reliability of a scale is the consistency with which it measures 
the abilities it sets out to test. It is concerned with two questions: 
(1) The limits between which any actual score may fall. (2) The 
probability that an individual given the same tests on another occa- 
sion will make a like score. As criteria of a scale's rehability, statis- 
ticians generally use one or several of the following measures: (1) the 
standard error of measurement, (2) the degree of correlation between 
the various portions of the scale, (3) the correlation between alternate 
forms of the same scale, (4) correlations between repeated administra- 

TABLE 26 


CoTrelaiion between various 'portions of the scale 



AOS3 

N 

r* 

P.E. 

4 tests X 4 testsf 

20-34 

355 

.90 

±.014 

Verbal I.Q.'s X Performance I.Q/s 

20-34 

355 

.83 

±.018 


* Corrected for attenuation. 

t The two groups consisted of Information, Digit Span, Picture Completion, 
and Block Design against Comprehension, Arithmetic, Picture Arrangement and 
Digit Symbol. 


TABLE 27 

Correlation between retests* 



AGE3 

N 

BHO 

P.E. 

Children 

10-13 

32 

.94 

±.013 

Adults 

20-34 

20 

.94 

±.018 


* Given at intervals of one month to one year. 


tions of the tests to the same individuals. We have data regarding 
three of the above four. The standard error of measurement (criec) for 
the full Scale (I.Q.'s) whose mean and S.D. are respectively 100.11 and 
14.69, is 5.674. The correlation data are given in tables 26 and 27. 

CORRELATIONS WITH OTHER SCALES 

As a final set of data of general interest, we have brought together a 
series of reported correlations between the W.-B. and other scales. These 
are given in table 28. Examination of table 28 shows that the correla- 
tion of the Wechsler-Bellevue with other standard intelligence tests is 
generally high. The correlations given in the table are all with the Full 
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The data are those of Goldfarb (9) ; the correlations were prepared for the author by Professor Lorge. 
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scale. Many of the cited studies also report separate correlations with 
the Verbal and Performance parts of the examination. Of these, the 
correlations reported for the Verbal part alone are usually almost as 
high and occasionally somewhat higher than those for the Full scale. 
Thus, Verbal X Stanford-Binet 1916, adolescents (2),* r = .87; X Stan- 
ford-Binet 1937 L, adult mental patients (10), r = .92; X Stanford- 
Binet 1916, college freshmen, r = .53; X Henmon-Nelson, adolescents, 
.80; X Morgan Mental Ability, college freshmen, r = .48; X C.A.V.D., 
adult males, r = .67. On the other hand, correlations of the Perform- 
ance part of the W.-B. with other scales are systematically lower than 
that of the Full scale, and often markedly so. The corresponding 
Performance correlations for the groups just enumerated are .56, .73, 
.34; and .58, .34 and .44 respectively. In general. Performance correla- 
tions are better at lower than at higher intelligence levels, better for 
adolescents than adults, and poorest for college students. 


*The numbers in the parentheses are to the references cited in table 28 . 



CHAPTER 10 

LIMITATIONS AND SPECIAL MERITS 


In this chapter we propose to discuss the limitations and merits of 
the Wechsler-Bellevue Scale in the light of our experience with it as a 
clinical instrument. The scale has now been employed at the Bellevue 
Psychiatric Hospital, the Court of General Sessions of New York City 
and the Queen’s General Hospital, for more than five years, during w’hich 
time it may be said to have been under continued comparison \vith other 
standard scales. Since its publication, the scale has furthermore had 
wide trial in many sections of the country, recently among some of our 
armed forces; we have also had the benefits both of published reports and 
personal communications regarding its shortcomings and advantages. 
The following remarks are intended to summarize these findings and 
reactions as well as our own further experiences with the scale. 

One of the first criticisms brought forward b}" examiners pertained 
to the differences in I.Q. between the Verbal and Performance parts of 
the Bellevue Scales when I.Q.’s were calculated separately on each part 
of the examination. Some of the examiners looked upon this fact as an 
advantage and others as a limitation. But from either point of view 
it again brought up the problem of the legitimacy of combining Per- 
formance and Verbal items into a single scale. The cases that came up 
for discussion were naturally those which showed large differences 
between Performance and Verbal I.Q.’s (10 to 15 points or more). Often 
they were subjects with special psychomotor or foreign language 
handicaps, but among them were many for whom no special defects 
could be posited. As these subjects were often hospital or clinic 
patients, the first question that suggested itself was how the matter stood 
when the scores of “normal’’ subjects were compared. Accordingly, 
we made a study of I.Q. differences obtained betwreen the Verbal parts 
of our scale among various samples of our standardizing population. 
The results of this study have already been given (table 23, p. 125). 
The data comprises 590 cases between ages 20 to 49 and 520 cases 
between ages 10 to 16. The mean I.Q. differences between Verbal 
and Performance rating of the two groups were 8.4 and 9.1 I.Q. points, 
respectively. It should be noted that these differences could not have 
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been predicted from the degree of correlation between the two test 
batteries (.83), nor from their mean LQ. values at any age. At no 
age were the mean Performance and Verbal I,Q.’?..more than one or 
two points apart.^ 

The question arises what an examiner should do when he finds a 
large discrepancy between Verbal and Non-verbal I.Q.’s. Three solu- 
tions are possible. One is to give preference to one or the other of the 
test results; another, to use the higher of the ratings obtained; and the 
third, to resort to some method of averaging them. The first of these 
methods is seemingly the one most generally favored by psychologists 
when results are obtained vith separate scales. Most psychologists 
when getting a large discrepancy between, say the Pintner-Paterson 
and the Binet, generally accept the latter. If a subject obtains an 
I.Q. of 85 on the Binet and 110 on the Pintner-Paterson Performance 
Tests, a very common rating given by psychologists is ^^Dull-Normal 
Intelligence with Special Manual Ability.” . . . This practice seems to 
us ill founded. We think it is incorrect because it not only assumes 
that there are different kinds of general intelligence, but because it 
further implies that the Performance tests are relatively unimportant 
as measures of general intelligence. Our Scale assumes the contrary. 
It was constructed on the hypothesis that an individual manifests in- 
telligence by his ability to do things, as well as by the way he can 
talk about them. 

The second method of treating discrepancies between Verbal and 
Performance I.Q.’s, namely, that of taking the higher of the two I.Q.’s 
as the true measure of an individuaTs intelligence, is likewise unsatis- 
factory. The main argument in its favor is, that by accepting the 
highest rating which a person makes, we are really getting what is most 
important, namely, a measure of his maximal functioning ability. Our 
own view that tests ought to be fitted to the subject would seem to favor 
this attitude. Also the fact that clinical experience often justifies the 
choice of the higher rather than the lower I.Q. Nevertheless, we are 
opposed to this selective procedure because we do not believe that any 
test which is either predominately Verbal or Non-verbal can alone 
completely measure an individuals general intelligence. 

The third way of meeting the problem of discrepancies between 
Verbal and Performance I.Q.’s is to assume that neither is correct and 
that the true I.Q. of any individual with a discrepancy of this sort will 

* The mean Performance and Verbal I.Q.’s for the entire population were 100.75 
and 100.11, respectively. 



138 


MEASUREMENT OF ADULT INTELLIGENCE 


generally lie at some intermediate point. The most probable value of 
this intermediate I.Q. will in most cases be obtained by averaging his 
different I.Q.’s. The Full-Scale Bellevue LQ.’s are such averages. 
This results from the fact that they have been obtained from test scores 
to which both performance and verbal items have been permitted to 
contribute. As now constituted, the Full-Scale gives equal weight to 
performance and verbal abilities, since it is made up of five performance 
and five verbal tests. It is possible that some other combination, say 
one made up of six verbal and four performance tests would constitute 
a better scale, but whatever the final proportion between the two, the 
important consequence of our method of combining them is that the 
subject’s LQ. is always an average I.Q. 

While insisting that intelligence ratings should be based on an aver- 
aged composite of Performance and Verbal test-scores, we should not go 
so far as to say that there may not be instances where such procedure 
would be unjustified. It would clearly be unfair to use a combined 
Verbal and Performance scale on subjects suffering from motor or visual 
defects or some special language handicap. One must also admit that 
from time to time one does encounter individuals, corresponding to what 
in the literature are referred to as persons ‘Svith special disabilities’’ who 
need special consideration. With such subjects it is often better to use 
one rather than another type of intelligence scale. But even in such in- 
stances, our own experience has shown that their ‘True” rating is never 
as high as that indicated by the results obtained with a type of test that 
favors them, just as it is never as low as that indicated by the tests 
which penalizes them. 

In connection with the problem of discrepancies between I.Q.’s 
obtained from the Verbal and Performance part of the Bellevue, it 
should be noted that except for the I.Q.’s near 100, the LQ. of a subject 
on the Full-Scale will seldom be equal to the exact average of his Verbal 
and Performance I.Q.’s, calculated separately. There will generally be 
a difference between them of one or two points, which will increase, as 
the I.Q.’s become smaller. Thus for mean I.Q.’s of about 60, the 
difference between the I.Q. of the Full-Scale and the LQ. obtained 
averaging the Verbal and Performance parts may amount to as much as 
4 or 5 points. Conversely, the Full-Scale I.Q. above 100 will tend to 
be a little higher than either Verbal or Performance I.Q. These dis- 
crepancies result from the fact that the LQ. distribution curves for the 
Full, Performance, and Verbal scales differ somewhat in form. The 
discrepancies could be allowed for by establishing separate classifies- 
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tions for each of the scales, but the differences are generally so small 
that it did not seem worth while. 

Another criticism of our Bellevue Scale made by some of the ex- 
aminers was that the I.Q.’s obtained with it on older subjects were too 
high. By ^‘too high” the examiners meant either that the Bellevue 
I.Q.’s were higher than those obtained with other scales, or that the 
individuals tested appeared to be of lower intelligence than the Bellevue 
I.Q.’s indicated. We have already pointed out why our adult I.Q.’s, 
particularly those for subjects 40 years and over, will be sj^stematically 
higher than those obtained by present standard scales. Older subjects 
require a lesser absolute score to obtain the same I.Q. Thus, to obtain 
an I. Q. of 100 at age 25 a subject has to attain a test score of 95; at 
year 35, a score of 87; at 3 "ear 45, a score of 81; and so on. But even 
when conceding the theoretical validity of the procedure, examiners 
found it hard to accept it in practice. Much as some of them tried, 
they could not convince themselves that a subject’s age really influenced 
the I.Q. to the extent posited by the Bellevue Scale. This feeling con- 
tinued primarily because they had been accustomed so long to obtaining 
I.Q. ratings with tests in which the age factor had been neglected. The 
Bellevue I.Q.’s seemed too high because in point of fact the Binet and 
other comparable I.Q.’s had always been too low. A similar explana- 
tion accounted for some of the impressions of examiners that the sub- 
ject’s intelligence was lower than the rating assigned to it by our scale. 
Here the difficulty was that most of the examiners attempted to inter- 
pret the test performance of the subject in terms of functioning levels 
primarily associated with children’s M.A. scores. Using children’s test 
scores, however, against which to match adult performance, is, as we 
know, quite invalid. A mental age of 12 years by a child of 12, may 
represent quite a different order of intelligence from what it does when 
attained by a ttim of 40. An examiner who has had little experience 
in testing adults really does not know what order of intelligence the 
scores made by them represent. In general he is more likely to be mis- 
led than helped by making use of criteria gained from the testing of 
children. 

The problem of differences between I.Q.’s obtained on the Binet and 
other scales will necessarily prove an important consideration of those 
who contemplate using our tests. We have devoted considerable time 
to investigating the problem from various angles. Our results show that 
when such discrepancies occur, most of our clinical, social and vocational 
data confirms the Bellevue rather than the discrepant I.Q. This is 
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particularly true of older individuals who have made low I.Q. ratings 
on the Stanford-Binet and considerably higher ones on the Bellevue 
Tests. Many of these are cases of subjects who if rated on their I.Q.’s 
attained on the Binet or similar tests would have to be classified as 
mentally defective, yet who could in no wise be so designated from a 
clinical point of view.^ In such cases it was particularly gratifying to 
find that the Bellevue norms gave them LQ.’s which were in accord 
with the general clinical impression (e.g., I.Q.'s more like 75 or 80 in- 
stead of 60 or 65). And the same might be said of many ratings which 
were above the defective level on the Binet but which again were in- 
creased by 10 or 15 points when the subject was tested with the Bellevue. 
The great advantage of the Bellevue Scale is that one is not nearly so 
often confronted with the problem one has when testing adults with 
other scales, of accounting for or explaining discrepancies between one’s 
quantitative results and clinical impressions. The two generally agree. 

We have thus far emphasized instances where the Bellevue I.Q. was 
higher than that of the I.Q. of the compared test. But we also had 
instances where the reverse was true, although these were generally 
rarer. Here again, however, clinical criteria, in the main, supported 
the Bellevue ratings. But we do not wish to convey the view that this 
was always the case. There were many instances where the Binet or 
the Army Alpha I.Q.’s did more nearly represent the subject’s intelli- 
gence level as measured by clinical criteria. That, of course, was to be 
expected, and does not refute our main contention. No test can be so 
uniformly free from limitations but that another may not be a better 
measure for particular cases. Every experienced exam i ner will have 
found that for certain types of individuals® some tests are more suitable 

* See for examples cases D. W. and M. B., p. 130 and p. 132. 

s This is particularly true of mental patients where a difference of 20, or even 
30 points in I.Q. between ratings obtained with different tests is not rare. Cer- 
tain types of schizophrenic patients, for example, will do extremely well on tests 
like the Army Alpha and do very badly on either the Bellevue or the Binet Scales. 
This is seemingly due to the fact that the Army Alpha is made up of what we may 
call 'perseverative' tests. Many of the tests, e.g. the Opposites test, merely 
require the subject to repeat the same task over and over again. All he has to do 
to answer a question is either to cross out or underline words or numbers. That 
requires very much less adaptation or directed effort than responding verbally to 
unrelated questions following one upon another without any special continuity, 
as is required by most individual types of examinations. On the other hand, there 
are subjects who require continual stimulation and encouragement and who when 
left to themselves go altogether to pieces. With such subjects one gets far better 
results by using a personal type of examination like the Binet or Bellevue Scales. 
There are, however, many exceptions to the rule. 
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than others. In such instances it is up to the examiner to use what he 
thinks is* the best instrument. The test must at all times be adapted 
to the subject. 

Another point to which we have given much consideration is the 
desirability of using the term I.Q. to designate the indices which we 
have offered as measures of brightness. A number of psychologists 
with whom we discussed the problem thought that we ought not use 
the term, and curiously enough for two very different reasons. One 
group argued that the term I.Q. has been so long associated with the 
Binet Tests that to call our index by the same name would only lead 
to confusion, particularly in view of the fact that it is calculated in an 
altogether different manner. Another group argued that the I.Q., how- 
ever calculated, had proved itself so inadequate and had so many draw- 
backs that it did not merit continuance. One psychologist went so 
far as to insist that the I.Q. had already ‘^destroyed itself’\ We have 
not felt, however, that the arguments of either group are sufficiently 
cogent. In the first place, we may note that while the term I.Q. has 
been most frequently associated with the Binet type of test, neither the 
original definition nor present applications of it make it contingent upon 
the use of any particular kind of test. Actually the term I.Q. has been 
used to calculate indices of a large number of tests other than the Binet. 
The objections to this application derive not from the way the I.Q. is 
calculated, but from the altered meaning it acquires when applied to 
different kinds of tests. The essential meaning of the I.Q. is that it is 
an index of brightness obtained by comparing an intelligence test score 
of a subject with the average score of individuals of his own age. It is 
precisely to this essential meaning of the term that the Bellevue I.Q, 
conforms, only instead of stopping at year 15 or 16, to which the M.A. 
over the C.A. method of calculating restricts it, we have made possible 
its extension to ages above 16. 

As regards the second argument that the I.Q. is such an unreliable 
measure that it is practically worthless, we must again take exception. 
It is true that the I.Q. as hitherto calculated neither remains constant 
nor equivalent throughout an individual’s life span. In trying to make 
it so, Heinis and others have abandoned the term, and in recent years 
Kuhlmann has come out in favor of the Per cent-of-Average ratio. We 
think, however, that the term I.Q. is too happy a one to be abandoned. 
With a little effort it can be freed of most of the objections which have 
been raised against it. The method of calculatiug it offered in this 
book does precisely this. We think it is not too much to ask, that in 
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view of the practical importance of the term, the old method of calcu- 
lating it be so modified as to eliminate its shortcomings.^ 

A final word as to the numerical differences between I.Q.’s obtained 
by the method outlined in this book and those furnished by the M.A. 
over the C.A. method. In particular we wish to indicate the order of 
I.Q. difference that may be expected between the Bellevue and Binet 
Scales with which they are likely to be compared. 

In general, adult I.Q.’s obtained on the Binet will differ from those 
obtained on the Bellevue Scale, in proportion to the degree to which 
the Binet fails to make allowance for the age factor. Accordingly, I.Q.’s 
obtained on the Bellevue Scale® will show close agreement with those 
obtained on the various revisions of the Binet up to about age 15, be- 
cause up to this age they both take into consideration the age factor. 
But from this age on the Bellevue I.Q. will begin to be systematically 
higher than the Binet I.Q. Between ages 15 to 30 the differences will 
be small. Beyond age 30 Bellevue I.Q.’s will grow larger and increase 
with the increasing age of the subject, so that when age 50 is reached 
they may be as much as 20 points higher than the Binet I.Q. 

That the differences between adult I.Q.’s obtained with the Bellevue 
Scale and those obtained with other scales are not due to differences in 
the level of test performance, but rather to the methods by which these 
levels are interpreted, may be shoum by recalculating I.Q.’s obtained 
in the latter scales, in a manner that will make allowance for the age 
factor. When this is done, I.Q.’s calculated on a standard test like 
the Stanford-Binet or Army Alpha will be foimd to approximate very 
closely the I.Q.’s obtained on the same subject with the Bellevue Scale. 
As many examiners using other scales might want to know what effect 
allowance for the age factor would have on their test results, we have 
prepared a table which will enable them to translate Binet into Belle- 
vue I.Q.'s. Table 45® shows what CA. denominators have to be as- 
sumed when calculating I.Q.^s for adults at different ages by the 

method when the age factor is allow^ed for. The table holds primarily 
for various revisions of the Binet Scale, but may be used on other tests 
which have been standardized against them. Roughly summarized, 

* Tennan and Merrill have already begun to do this by using a sliding denomi- 
nator when calculating I.Q/s for ages 13 to 16. All that would be necessary to 
make their method of calculating I.Q/s applicable to adults would be to furnish 
proper denominators for ages above 16 in the same way. 

® More specifically, the Verbal part of the Scale. 

® See appendix 5. 
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our analysis showed that if 15 years is taken as the M.A. of the average 
adult, the mean I.Q. for persons between ages 15 and 30 will remain 
fairly constant. Continued beyond this age the use of a 15 year de- 
nominator results in a systematic drop in the I.Q. of about two-thirds 
of a point per year. This drop can best be compensated for by the use 
of an appropriate regression equation."^ But sufficient accuracy is 
obtained by diminishing the assumed denominator by 6 months for 
every 5 years accretion in chronological age. 

While the correction for age is probably the main cause for the differ- 
ences in adult I.Q.’s obtained on the Bellevue and the Binet Scales, it is, 
of course, not the only factor. Other factors which contribute are: 
differences in the content of the respective scales; differences in th"^* 
statistical procedure““ and, differences in the original populations used 
in the standardization. The Bellevue scale contains a considerably 
greater proportion of performance to verbal material than does the 
Stanford-Binet scale, so that the two scales do not quite measure the 
same thing or, if the same thing, not in the same way. We have already 
discussed the general argument for combining performance with verbal, 
material, but regardless of the theoretic validity of this procedure the 
practical effect is to make for differences of achievement when individ- 
uals are tested with both scales. This is made manifest by differences 
in order of correlation between the Bellevue Verbal and Bellevue Per- 
formance with different forms of the Binet. Reported studies (see p. 134) 
show consistently high correlations (.60 to .85) between Binet and Belle- 
vue Verbal I.Q.s, but only low or moderate ones between Binet and 
Bellevue Performance I.Q.s. In the latter respect, the results are no 
different from correlations obtained between Binet and other perform- 
ance tests, but the fact that the Verbal and Performance tests correlate 
least at college level shows that the level as well as the range of intelli- 
gence of the groups compared must be taken into account in the inter- 
pretation of results. 

The influence of the particular statistical procedures used in the stand- 
ardization of the Bellevue scales is less patent, but one seeming effect 
has been a reduction in the range of I.Q.’s obtainable. The scale doe^ 
not have either as high a “ceffi ng^^ or as low ^“floor’" as some other 
scales (e.g., the 1937 Stanford-Binet). This means that it is not pos/ 
sible to obtain as low or as high I.Q.’s on the Bellevue as on these other 

^The regression of score on age for the Bellevue Scale is y = .735 x -f S5.77, 
where y = mean score at any age and x = the difference between 40 and the 
chronological age at which the score is desired. The equation holds only for 
ages 25 and over. 
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scales. The constriction of range may be considered a limitation or a 
merit of the Bellevue depending upon how one interprets extreme meas- 
ures, but in any event, brings up the question of the diagnostic signif- 
icance of the I.Q.’s. Obviously, when I.Q.’s of scales having different 
ranges are compared, the percent of population falling within certain 
I.Q. limits vdll not be identical. And, conversely, individuals obtaining 
identical LQ.’s on the different scales may, in terms of percent of rating, 
represent different levels of intelligence. 

Differences in the diagnostic significance of I.Q.^s obtained on different 
scales being primarily a function of their respective deviations, could 
theoretically be avoided if the test scores of the compared scales were 
given (as it has often been urged) in terms of standard deviation units. 
In the absence of this practice, valid comparison of LQ.’s requires their 
equation to some common base, and, as a provisional attempt in this 
direction, we have drawn a table of equivalence between the full Belle- 
vue, the “old"’ 1916 and the “new” 1937 revisions of the Stanford-Binet. 
The results are shown in table 46, appendix 6. Examination of the 
table shows that within the overlap of ages compared (ages 8-18) the 
I.Q, limits for the P.E. and percentile equivalents are approximately the 
same (i.e. within 5 I.Q. points) on all three scales, for I.Q.’s falling below 
the mean; that they are likewise very close between the 1916 Stanford 
for I.Q.’s above 100; but that the limiting I.Q.’s for both the Bellevue 
and the 1916 Stanford-Binet are significantly lower than those for the 
1937 revision for I.Q.’s above 100, This last difference is particularly 
marked at the superior level of intelligence amounting to an average of 
10 I.Q, points at deviations of 2 to 3 P.E. above the mean. It might 
be further noted here that comparative studies support these theoretical 
expectancies. 

We have already noted the problem of the low ceiling imputed to the 
Wechsler-Bellevue Scale, Our o^vn opinion is that this reduced ceiling 
^ compared to other tests is not an accident of the construction of the 
scale, but represents a factual^ilpation, namely that it is not possible to 
measure general intelligence beyond a certain level. I.Q.’s above 130, 
whether on the BeUevue or any other tests, do not represent superiority 
in intelligence level, but special aptitude in intellectual ability. A 
measure of such ability, of course, is desirable since it correlates well 
with academic educability and degree of scholastic success. To meet 
this need, we have extended the range of the Verbal part of the scale at 
the upper levels and by extending the maximum score possible on them 
from 17 to 18. These changes will not affect the scores of most of the 
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population in any significant way, but should nevertheless furnish 
better discriminating scores for I.Q.’s above 120. 

Another question regarding which we have had recurrent inquiry 
is the feasibility of an abbreviated scale. The scale may be shortened 
to include as few as five subtests, without seriously altering its reliability 
provided a judicious choice of subtests is made. The reduction is pos- 
sible because of the relatively high correlations between most of the 
subtests with total scale score (see table 43). Of course the more tests 
included the higher will be the correlation of the battery with the Full 
scale, but not necessarily so. Thus, the correlation between seven of the 
tests (all except Object Assembly, Picture Arrangement and Digit Sym- 
bol) and the Full scale is above .90. But the correlation of the Verbal 
part alone with the Full scale is also around .90, and its correlation with 
other scales is generally just as high as that of the Full scale, and some- 
times even higher. 

Although it is possible to derive valid short forms of the Wechsler- 
Bellevue, the legitimacy of emplo 3 ing them will, apart from reasons of 
practical necessity, depend on the objectives of the examiner. Where 
the main aim is to obtain a reliable intelligence rating quickly, an abbre- 
viated form of the scale will suflBlce in most cases. For such purposes, 
the simplest and safest procedure is to use the Verbal part of the exam- 
ination alone, and rate the subject on the basis of I.Q.’s furnished for 
this part of the scale. If a combination of Verbal and Performance tests 
is desired, this may also be done, provided the subtests chosen are from 
among those showing highest correlations with total score and the bat- 
tery consists of at least six tests. In such cases the sum of the subject’s 
weighted score should first be prorated to a base of ten tests and his 
rating obtained by looking up this score in the Full Scale I.Q. tables. 
If, however, the examiner desires something more than a mere I.Q., 
omission of any one or more tests will reduce the diagnostic value of the 
scale by that much. It is often the tests which correlate least with 
global I.Q., that contribute most to the clinical picture. In any event, 
reduction in the number of the tests used not only restricts the areas 
in which one can observe the subject function, but makes attempt at 
pattern analysis extremely difiB.cult, if not impossible. 



CHAPTER 11 

DIAGNOSTIC AND CLINICAL FEATURES 

Although the primary purpose of an intelligence examination is to give 
a valid and reliable measure of the subject’s global intellectual capacity, 
it is reasonable to expect that any well conceived intelligence scale will 
furnish its user with something more than an I.Q. or M.A. In point of 
fact, most intelligence examinations, when administered individually, 
make available a certain amount of data regarding the testee’s mode of 
reaction, his special abilities or disabilities and, not infrequently, some 
indications of his personality traits. At present, the amount of this 
sort of adjuvant data which may be derived from an intelligence ex- 
amination is in a large measure dependent upon the individual examin- 
er’s clinical experience and sagacity. Ko doubt this will always remain 
true to a greater or lesser degree. But much also depends upon the 
intrinsic merits and diagnostic possibilities of the tests themselves. 
The following remarks are accordingly directed to a discussion of those 
features of the Bellevue scale which either preliminary analysis or subse- 
quent experience have shown to be of clinical and diagnostic value. 

The most obviously useful feature of the Wechsler-Bellevue scales is 
their division into a Verbal and Performance part. We have already 
discussed the general significance of ^his division. Its h priori value is 
that it makes possible a comparison between a subject’s facility in using 
words and symbols and his ability to manipulate objects, and to per- 
ceive visual patterns. In practice this division is substantiated by 
differences between posited abilities and various occupational aptitudes. 
Clerical workers and teachers, in general, do much better on verbal tests, 
whereas manual workers and mechanics do better on performance. 
The correlations are sufficiently high to be of value in vocational guid- 
ance, particularly with adolescents of high school age. 

Apart from their possible relation to vocational aptitudes, differences 
between verbal and performance test scores, particularly when large, 
have a special interest for the clinician because such discrepancies are 
frequently associated with certain types of mental pathology. When- 
ever a mental disorder produces a change in the individual’s functioning 
capacity, the resultant loss is generally not uniform, but affects certain 
abilities more than others. This fact is frequently made use of in a 

146 



DIAaiSrOSTIC and clinical featuees 


147 


crude way in psychiatry and neurology where specific disturbances or 
defects are considered pathognomic symptoms of various disease en- 
tities. Familiar examples are t!^ disturbances of association (flighti- 
ness of ideas) in manic depressive insanity and the memory defects in 
chronic alcoholism of the Xorsakoff type. In so far as the diagnostic 
significance of large differences between verbal and performance ability 
as a whole is concerned, the general finding is that in most mental dis- 
orders impairment of functioning is greater in the performance than in 
the verbal sphere. This holds for psychoses of every type, organic 
brain disease, and to a lesser though still large degree, in most psycho- 
neuroses. On the other side of the fence there are only two groups. 
One is the adolescent psychopath (without psychosis) and the other the 
high grade mental defective. Both of these do better on performance 
than on the verbal tests. It is interesting to note that both psycho- 
paths and mental defectives differ from other psychopathic states in 
that they represent failure of functioning due to a flack of — ’ rather than 
a disturbance or disorganization of functioning ability.^ 

In appraising differences between verbal and performance test scores 
one must naturally allow for variability even among normal individuals. 
For subjects with I.Q.’s not far from the average, a variation of 8 to 10 
points between verbal and performance in either direction is within the 
normal range. But the amount as well as the direction of the differ- 
ences also varies with the age and intelligence level of the individual. 
Subjecte^o^Tjperior intelligence generally do better on the verbal, and 
subjects of inferior intelligence better on the performance part of the 
examination. There are also racial (group) and cultural differences. 
We have made no systematic study of this aspect of the problem but 
experience shows that the psychometric patterns of colored subjects for 
example, need special interpretation. Our general observations also 
confirm the findings of older studies which show that Jewish children 
do better on verbal and Italian children on performance tests. Among 
adults, the nature of the subject’s occupation is frequently an important 
factor, so that carpenters, as might be expected, will generally score 
higher on performance and lawyers, on verbal tests. All this means, of 
course, that a significant difference between a subject’s verbal and per- 
formance score cannot be interpreted carte blanche but only after due 

1 Appreciation of this flack' helps us to understand why psychiatrists of the 
older generation used to refer to certain classes of psychopaths as “moral imbe- 
ciles”. This appraisal of the psychopath agrees with Kahn’s definition of psycho- 
pathic personalities as individuals characterized by quantitative divergencies in 
impulse, temperament . . . etc. 
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weight has been given to the various factors which may have contributed 
to it. With these limitations in mind, the data given in table 30 may 
be accepted as clinically valid. 

The second clinically useful feature of the Bellevue scale, is that the 
different mental abilities tested by it, may be compared at all levels of 
functioning. This is achieved through the fact that the same type of 
material is used throughout the scale, and because the individual sub- 
tests of the scale have been equated. This makes it possible to compare 
a subject’s individual test scores and to look for significant test patterns. 
For any pattern analysis it is necessary to know the subtest score for 
any given total score, and the variations in these scores which occur 
with age. Since each of the subtests have been equated, the mean 
expected score on any given subtest for any given total score may be 
obtained with good approximation by dividiug the total score by 10. 

TABLE 29 

Clinical Groups Generally Scoring Higher on Verbal Tests 
Organic brain disease 
Psychoses 
Psychoneuroses 

Clinical Groups Generally Scoring Higher on Performance Tests 
Psychopaths (adolescent) 

Mental defectives 


Thus, if a subject makes a total score of 95, the mean expectancy for any 
given subtest is 9.5; or, since the performance and verbal scores do not 
contribute identically equal amounts to the total score, a somewhat 
better approximation is obtained by dividing the sum of the verbal 
tests and the sum of the performance tests by 5 respectively. Thus, 
for a subject making a total score^ of 95 with verbal and performance 
scores of 50 and 45 respectively, the mean expected scores for verbal 
tests would be 10 and for performance tests 9 points each. For the 
variations in subtest scores with age, separate data are, of course, neces- 
sary, and these data have been furnished in Tables 39 and 40, page 222. 

The problem next arises as to what constitutes a significantly varying 
score. This necessarily has to be established both through clinical and 
statistical vahdation, and here again, not only for any given score but 
for all possible combinations of scores. While the data is available for 
such complex analysis, the multiple tables necessary for it have not 

* These are, of course, the weighted scores. 
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been made, but again may be bad vnth sufficient approximations, by 
the following rule of thumb: For total scores on the full scale lying 
within the limits 80 to 110,® a difference of more than two points from the 
mean subtest score is significant. Thus, a subject making a total score 
of 95 with the following distribution of subtests: 


Comprehension 

Arithmetic 

Information 

Digit-span 

Similarities 

Picture Arrangement 
Picture Completion . , 

Blocks 

Object Assembly. . . . 
Digit Symbol 


11 

9 



13 X 
9 

6 S 


11 ^ 
10 
9 


would manifest significant variations or differences in the Similarities, 
Picture Completion and Digit-span tests. 

For subjects having total scores beyond the limits 80-110, the devia- 
tion of the individuaTs subtests from their mean, defining a significant 
difference may be roughly obtained by dividing the mean subtest score 
by 4. Example: Subject A makes a full score of 56. The mean sub- 
test score is accordingly 5.6 points; one fourth of this is 1.4 points. 
Hence, any of his individual subtest scores which deviate by more than 
1.5 points from this mean would be significant%fl^gain, subject B 
obtains a full score of 132; mean subtest score is 13.12. Hence, for sub- 
ject B any individual subscores would have to deviate by 3 points 
(actually 3.3 points) from the mean of the subtests in order to be 
significant. 

T he amount by '^which a ^ven subtest must differ from the mean of 
the various subtests in order to be significant, is roughly proportional to 
the magnitude of his total score.*^ In certain instances where the dis- 
crepancy between performance and verbal is very large, it is desirable 


* This represents a deviation of approximately dbl S.D. from the mean of 95 
at ages 20 to 35. 

* This may be assumed if the correlation between variability and ability is 
small. Such seemingly is the case at least between the ages 20 and 40. Though 
the r^s between them have not been calculated, a low correlation between ability 


(total score) and variability 


(i) 


would be in accord with the findings of most 


other investigators. Cf. Bown, M.D., Variability as a Function of Ability, etc. 
Doctor's Thesis, Columbia Univ., 1941. 
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and often necessary to treat each part of the examination separately. 
This is particularly true in certain organic cases which show large dis- 
crepancies between verbal and performance as a whole but relatively 


TABLE 30 

Test characteristics oj various clinical groups 


A. Organic Brain Disease 

Information 

+ 


Comprehension 


except in paretics 

Arithmetic 

-- 


Digit Span 

— 

particularly digits backwards 

Similarities 



Vocabulary 

+ + 


Piet. Arrangement 

0 


Piet. Completion 

0 


Object Assembly 

0 to 

depending on type of impairment 

Block Design 

to 0 

depending on type of impairment 

Digit-Symbol 

— 


Verbal higher than Performance 


Intertest Variability: Omitting the 2 or 3 tests on which subject is likely to 

do very badly, scatter of remainder generally small. 

B. Schizophrenia 

1 Information 

4“ to + 4" 


Comprehension 

4“ to — 

depending on type of Schizophrenia 

— Arithmetic 

0 to - 


Digit Span 

0 to 4- 


-- Similarities 

4" to — — 

depending on type of Schizophrenia 

Vocabulary 

4- 4- 


Piet. Completion 

0 to 

depending on type of Schizophrenia 

Piet. Arrangement 

— to 0 


Object Assembly 

— 


^ Block Design 

0 to 4-’ 


Digit-Symbol 

- 



Verbal generally higher than Performance 

Sum of Piet. Arrangement plus Comprehension less than Information and 
Block Design. 

Object Assembly much below Block Design. 

Very low Similarities with high Vocab. and Inf., deiSnitely pathognomic. 

Intertest Variability marked; and in most cases greater between sub tests of 
the Verbal than of the Performance part of the Scale. 
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C. Neurotics 


Information 

4- 

Comprehension 

+ 

Arithmetic 

0 to — 

Digit Span 

— but tmpredictable 

Similarities 

+ 

Vocabulary 

+ 

Piet. Completion 

0 

Piet. Arrangement 

— 

Object Assembly 

— 

Block Design 

0 

Digit-Symbol 

— 

Verbal generally higher than Performance, 

Sum of Piet. Completion plus Block Design generally greater than Sum of 
Piet. Arrangement and Object Assembly- 

Intertest Variability: 

moderate; less than in psychoses but greater than in 

normals or psychopaths. 

D. Psychopaths (Adolescent) 

Information 

— to 

Comprehension 

0 to — 

Arithmetic 

— 

Digit Span 

0 to — 

Similarities 

— to 0 

Vocabulary 

0 

Piet. Completion 

-b to 0 

Piet. Arrangement 

-b-f- to -f* 

Object Assembly 

+ -b to -b 

Block Design 

-b to 0 

Digit -Symbol 

0 to — 

Performance generally higher than Verbal. 

Sum of Object Assembly plus Piet. Arrangement nearly always greater than 
the Sum of Block Design and Piet. Completion test scores. 

Intertest Variability: 

: comparatively limited 

E. Mental Defectives 


Information 

0 to — 

Comprehension 


Arithmetic 

— 

Digit Span 

— to 0 

Similarities 

0 

Vocabulary 

-b 4- 

Piet. Completion 

— to 0 

Piet. Arrangement 

0 

Object Assembly 

4" 

Block Design 

0 to “b 

Digit-Symbol 

— to 4“ 

Performance generally higher than Verbal. 

Intertest Variability: 

limited. 

While Similarities and Block Design are generally at mean and occasionally 

above mean, high scores on either are counterindicative of Mental De- 

ficiency. 
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little variation between the subtests constituting the two parts. Case 
0-1 given below is an example. 

Having determined how much of a test score difference constitutes 
a significant deviation, the general clinical problem of test patterning 
consists of establishing associations between particular test score di- 
vergencies, and specific clinical entities. This may be done in several 
ways, but the method which the author has found most adaptable for 
clinical use is that of “counting"’ or integrating signs. A “sign"" or 
symptom is a low test score which has been found to be characteristic 
of, or associated with, a particular type of mental disorder or disfunc- 
tion. Thus, if a low Performance and a particularly low Block Design 
test score has been found characteristic of organic brain disease cases, 
then a subject who does badly on both these items will be assumed to 
show two “signs"" of possible organic involvement. This, of course, 
does not mean that it is necessarily an organic case, because there are 
other conditions which also show these “signs’". But the first step 
always consists of noting the number of instances* of relatively low or 
contrastingly high scores in the subject’s test score distribution. 

The above procedure, of course, implies a prior demonstration of the 
existence of established associations or correlations between particular 
test ^‘signs’" and disease entities. The extensive statistical work re- 
quired for such correlations has only barely begun, but we have accumu- 
lated sufficient clinical experience with the Bellevue Scale to warrant a 
presentation of certain empirical findings which we believe will be of use 
to the clinician in his attempts at diagnosis. These have been put 
together in Table 30 which attempts to summarize the test patterning 
met with in various common mental disease entities. By test patterning 
here, is merely meant a summary of the various tests on which patients 
suffering from various mental disorders have been found to do particu- 
larly poorly or have shown significantly contrasting score differences. 

The meaning of the symbols used in table 30 are as follows: + and 
+ + signifies relatively good, high or considerably above the mean of 
the subject’s remaining subtest scores; — and — relatively poor, 
low or considerably below the mean of the subject’s remaining subtest 
scores; 0, no significant deviation from the mean of the remaining subtest 
scores. The combined symbols + to 0, — to 0, etc., signify that the 
deviation in subtest scores may be either above or approximately at the 
mean of the remaining subtests, etc. In general, the symbol placed 
first represents the general tendency. Thus, Object Assembly under 
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Organic brain disease is marked 0 to . This means that, in general, 

the Organic subject’s score on Object Assembly is not outstanding but 
that in some cases it may be extremely low. 

On a quantitative basis the symbols have approximately the following 
significance: 

+ a deviation of from 1.5 to 2.5 units above the mean subtest score 
+ + a deviation of 3 or more units above the mean subtest score 
— a deviation of from 1.5 to 2.5 units below the mean subtest score 

a deviation of 3 or more units below the mean subtest score 

0 a deviation of 4-1.5 to —1.5 units from the mean subtest score. 

All deviations are in terms of weighted score units. 

COMMENTS 

1. Organic brain diseases- The category organic brain diseases covers 
a large group of S 3 uidromes ranging all the way from brain tumors to 
chronic alcoholism. The cases cited are not intended as examples of 
differential diagnoses between any specific disease entities but rather as 
illustrations of the disorganization erf thejntellec^^^ observed 

in most organic brain cases irrespective of type. The most general 
S 3 unptoms of organic brain cases are disturbances in the visual motor 
spheres, a loss of shift, various memory defects and a falling off of 
capacities involving mjaniffilonj a^ synth etic ability. * 

Organic brain cases, with few exceptions, do consistently better on 
verbal than on performance tests. The greatest and most consistent 
falling off is on the Digit Symbol tests; but even more diagnostic, though 
not necessarily the most adversely affected performance, is the organic’s 
inability to do the Block Design test, which is systematically associated 
with disturbances in visual motor organization. Next in order of fre- 
quency to a low score on the Digit Symbol and the Block Design are 
low scores on the Arithmetic, and depending on the type of organic 
involvement, the Object Assembly test. Memory defect is reflected in 
low memory span score, particularly on digits backwards. Certain 
cases also do badly on the Similarities test and this may reflect either a 
loss in conceptual thinking or, more frequently, an increasing rigidity 
in thought processes. All new learning is markedly affecte d. This 
accounts mainly for the organic’s iqw score on the Digit Symbol test, 
although visual motor disturbances alsq.play a role in it. 

Some groups of cases, like the paretics and arteriosclerotics, often 
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show a generalized deterioration, that is, do badly on nearly all^ tests. 
In general, they do better on verbal than on performance, but show no 
great variability as regards the tests which constitute each part of the 
Scale. This fact is often of value in differential diagnosis, for example, 
as between traumatic brain injury where the defects are uneven, and 
general paresis where the intellectual processes are more or less equally 
impaired. 

2. Schizophrenia. Schizophrenia or dementia-praecox comprises a 
related^^Qgg of a^^tions rather than a singl e disease en tity. Although 
the classical division of Seiheiittal^fag^ox info"TEe”frar tj^pes, catatonic, 
paranoid, hebephrenic and simple, is more theoretical than factual, it 
does suggest what experience demonstrates, that schizophrenics may 
vary widely among themselves both as to s 3 nnptomatology and general 
picture. Moreover, the general diagnosis of schizophrenia itself is 
often contingent upon the orientation of the psychiatrist or the school to 
which he may belong. A case which in one hospital is diagnosed as 
schizophrenia may, very well at another, be labelled manic depressive, 
and vice-versa. In view of this situation it is obvious that no single 
list of signs can be either suflSciently comprehensive or free from ex- 
ceptions if not contradictions. 

Intellectually, the most general effect of the schizophrenic process is 
the impairment of the subject’s mental efficiency. This loss is evidenced 
by the low scores which he makes on most tests calling for immedia te 
and directed effort. Occupationally, this is shown by the fact that the 
schizophrenic’s vocational adjustment is often considerably inferior to 
what might be expected of a person with his original endowment. Sup- 
plementing the impairment in mental efficiency, the schizophrenic is 
further characterized by a marked slowing up of his thinking^ a los^m 
mental shift and a tendency toward perseveration. As often noted, he 
doeT^ffiuch^ttef 'bh^verbal This relative superiority of the 

schizophrenic is of special interest because ordinarily one does not think 
of him as a verbalist. Clinically he is continually being referred to as a 
shut-in, uncommunicative person. The inconsistency, however, between 
the two findings is explained when it is discovered that the verbal tests 
on which the schizophrenic does well are precisely those which do not ^ 
reauirejmuch^jve rbalization ^ When they do make this demand, as in 
the case of the Similarities test, he is likely to fail. This failure is not 
due to a lack of either understanding or linguistic facility, but to a dis- 
tortion in the patient’s ideational processes. The schizophrenic mis- 
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interprets words just as he misinterprets reality, and his incongruent 
replies, like his bizarre ideas, are a product of this misinterpretation. 

Another characteristic of the schizophrenic is his inability to deal with 
concrete and specific situations. He is oblivious to details and does not 
percieve ordinary likenesses and differences, difficulties which are often 
reflected by the poor scores he attains on either the Similarities or Pic- 
ture Completion tests or both. Last but not least is the schizophrenic's 
unpredictability, so that now and again one finds patients who do well 
on one or several of the tests, failure on which we have listed as charac- 
teristic of schizophrenics. A thorough acquaintance with a particular 
case generally makes it possible to account for the contradictory find- 
ings, but the occurrence of such exceptions shows that the diagnosis of 
schizophrenic through psychometric '‘signs” or "patterns” is not a cut 
and dried affair. The cases we have used as illustrations are typical 
only in the sense that they are of individuals who manifest most or many 
of the signs characteristic of the schizophrenic group as a whole. 

' 3. Psychopaths. The most outstanding single feature of the adoles- 
cent psychopath^s test pattern is his systematic high performance score 
as compared with Ms verbal test score. Occasional exceptions occur 
but these generally reflect some ‘ special ability or disability. Also 
worthy of note is the good score frequently made by the psychopath on 
the Pictum ^ Arrangement test, a finding that is surprising because this 
test has been interpreted as measuring social intelligence. If this in- 
terpretation is correct, a distinction must be made between understand- 
ing and resultant behaviour. Psychopaths generally have a grasp of 
social situations, but they are inclined to manipulate them to their own 
advantage in an anti-social way. The point cannot however be pushed 
too far because there are many exceptions to the rule, particularly in 
the case of extreme psychopaths who are not only perverse in their 
behavior, but distorted in their social comprehension. The psycho- 
path's test performance as a whole is characterized by a breeziness and 
self-assurance wMch contrasts markedly to that of a neurotic. He is 
not bothered by contradictions and, when not ornery, takes everything 
in Ms stride. His abstract thinking is generally below average and this 
is frequently indicated by a low score on the Similarities test. He also 
tends to do poorly on Arithmetical reasoning; but in tMs connection it 
must be noted that the mean Arithmetic subtest score forthe normal 
adolescent (ages 12 to 16) is systematically lower than his mean score 
on the remaining tests of the scale. 



156 


MEASUREMENT OF ADULT INTELLIGENCE 


The test pattern just described is based primarily on the performance 
of male adolescent psychopaths. Further experience has shown that 
it is also applicable to the adult male psychopath; but a study now under 
way® indicates that it may not have the same diagnostic value in the case 
of the female psychopath. There is seemingly an important sex differ- 
ence. One feature, however, common to both male and female adoles- 
cent psychopaths is a relatively low score on the Information test. This 
may be due in part to the educational retardation (truancy, etc.) often 
associated with adolescent delinquents, but the low Information never- 
theless holds up as a consistently reliable sign. 

4. Neurotics, The neurotic generally dqesjKKniyjpn^t^ts 
quire iipjaediate effort. He is inclined to look upon each task as a 
challenge and is apprehensive of the impression he may make on the 
examiner. The result is that he is often ‘'blocked’^ overcritical and 
erratic. These characteristics may be evidenced on any test but are 
best revealed by the neurotic’s performance on the Object Assembly and 
the Digit Span. Though the neurotic seldom puts the Object Assembly 
together in an absurd fashion his effort is characterized by a great deal 
of trial and error. The absence of absurd configurations is what often 
distinguishes neurotics from organic cases who also make low scores on 
this test. The neurotic’s successes and failures on the Digit-Span test 
are unpredictable. He often fails on an easier digit series and passes 
on a harder one; frequently too, he repeats as many and sometimes more 
digits backwards than he does forwards. 

Although the neurotic’s performance is generally lower than his ver- 
bal score, exceptions are not uncommon, particularly in hysterics and 
obsessives. Many of the latter are preoccupied with numbers and often 
do surprisingly well on both the memory span and on the arithmetic 
tests. Neurotics engaged in clerical work may also get rfilatively high 
scores on the Digit Symbol. The low score of the neurotic on the Pic- 
ture Arrangement test is frequently associated with lack of social alert- 
ness and reflects their common inability to deal with social situations. 

Neurotic anxiety often manifests itself in the subject’s hesitation, 
imexpected failures and balking at various test items. Patients with 
anxiety neuroses and neurotic depression require, as might be expected, 
continued reassurance. Neurotic patients generally overestimate their 
own intellectual ability. Test variability in neurotics is greater than 
that of normal subjects but less than that of psychotics. 

* By F. Halpem at the Bellevue Psychiatric Hospital. 
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5. Menial defectives. Mental defectives do not ordinarily present any 
special diagnostic problem except where the social rather than the in- 
tellectual prognosis is the paramount issue, and in a small number of 
cases which have to be differentiated from simple schizophrenics. In 
differentiating between simple schizophrenia and mental deficiency, 
the most critical tests appear to be Arithmetic, Similarities and Block 
Designs. Schizophrenics may on occasion get high scores on any or all of 
these tests, but the mental defective, almost never. These three tests 
also differentiate between borderline and the definite mental deficiency.® 
In a recent study comparing these two groups we did not find a single 
mental defective who attained a score equal to the lower limits of nor- 
mality. This is in contrast with what happens in the case of the Digits- 
Span and the Object Assembly tests on which defectives will not infre- 
quently obtain non-defective and even average scores. Mental 
defectives, especially older subjects, tend to do particularly well on the 
Vocabulary test. That is to say, the mean score of the group as a whole 
on this test is significantly higher than that of any other subtest, and this 
in spite of the fact that mental defectives generally score higher on the 
Verbal than on the Performance part of the scale. 

6. Intertest Variability. The notation large intertest variability^, 
particularly as it applies to schizophrenics, requires some special com- 
ment. In a recent study Magaret (18) has reported that the intertest 
variability of this group of patients on the Wechsler-BeUevue is not 
particularly large and, in any case, not significantly greater than that 
met with in mental defectives. These findings are not only different 
from our own observations but also contrary to the clinical experience of 
other investigators, both with the Bellevue and other scales. The dis- 
crepancy was hard to accoxmt for until further examination of Magaret's 
data showed that her findings were primarily due to the method used in 
calculating the intertest variability. This method consisted of sub- 
tracting the mean variability of each subtest from the mean of aU the 
tests, and then taking the average of these deviations as a measure of the 
variability of the group appraised. Now the effect of getting a measure 
of variability from a mean of the means is simply to obliterate the vari- 
ability arising from individual differences. An illustration will make 
clear how this effect is achieved: Suppose two subjects each have a total 

® A Study of the Sub-tests of the Bellevue Intelligence Scale in Borderline and 
Mental Defective Cases by D. Wechsler, H. Israel and B. Balinsky. American 
Journal of Mental Deficiency, 1941, XLV, 555-658. 



158 


MEASUREMENT OF ADULT INTELLIGENCE 


score of 70 on the FnU Scale and an hypothetical subtest distribution as 
follows: 


Subject A 

Subject B 


Information 

4 Information 

.... 10 

Comprehension 

5 7* Comprehension 

.... 9 

Arithmetic 

10 ; Arithmetic 

.... 4 

Digits 

3 Digits 

.... 11 

Similarities 

9 Similarities 

.... 5 

Picture Arr. . . 

9 Picture Arr 

.... 10 

Picture Comp. 

4 Picture Comp 

.... 10 

Block Design 

4 Block Design 

.... 10 

Object Assembly. 

12 ' Object Assembly 

.... 2 

Digit Symbol 

10 ‘ Digit Symbol 

.... 4 


The mean subtest score of each of the subjects is 7, and the mean 
deviation of the subtests from this mean 3.0, a deviation which of course 
is considerable. If the subtest scores of both are combined, the mean 
subtest score of the two subjects taken as a group is again 7, but the 
mean deviation from the mean becomes exactly 0. Thus the averaging 
of the individual subjects’ scores has served to obliterate the variability 
of the group. In estimating the subtest variability of a group 
it is necessary to first treat each individual separately, that is, 
first obtain the subtest deviations from each subject’s own mean and 
only afterwards average the deviations so obtained. When this is done, 
the variability of psychotics show’s up considerably greater than that of 
either normal or defective subjects. 

The data given in table 30 will be valuable to its user in proportion to 
his familiarity with the Scale as a whole and his clinical acquaintance 
with the disease entities. They are not intended, nor can they be used 
as psychometric short cuts to psychiatric diagnosis. But they can be of 
considerable differential aid if the examiner is aware of the statistical 
and nosological problems which all such tables present. Thus, a careful 
examination of the table will show that the same tests are recurrently 
failed by different clinical groups. For example, all groups except the 
psychopaths and mental defectives do less well on performance than on 
verbal tests. Again, Digit Symbol as a “do-bad-on” test is common to 
the neuroses, organics and many psychoses; and nearly all subjects 
with mental disorders do relatively well on Vocabulary and Information. 
Accordingly, it is not merely the incidence of a given “sign” but the 
character of the other “signs” associated with it in combination, that 
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determine its diagnostic significance. In general, the problem of diag- 
nosis by test score differentiae resembles medical diagnosis through 
physical signs, A listing of mere symptoms met with in different dis- 
eases would show relatively little correlation between any one sign and 
any one specific disease entity, for the reason that the same symptoms 
are met with in many different diseases. An elevated temperature is 
characteristic of pneumonia, erysipelas, abscess of the arm, etc. None, 
therefore, would make a diagnosis of pneumonia on high temperature 
alone. However, if the high temperature were accompanied by con- 
solidation of the lungs, it would quite obviously indicate sii active pneu- 
monia; if it were accompanied by a ‘‘butterfly” rash with raised edges, 
erysipelas, and so on. 

In addition to assuming a- correlation between “sign” and disease 
entity, the diagnostic technique just illustrated also assumes some 
differential or critical score method of evaluation.^ But as the reader 
will have observed, no attempt has been made to furnish such critical 
scores for any of the disease entities listed. There are several reasons 
for this omission. In the first place, none has been statistically vali- 
dated. But even if this had been done, the writer would have been 
extremely loathe to present such critical scores for fear of conveying the 
impression that all that is necessary to make a diagnosis is an ability to 
look up a table of interpreted scores. That such apprehension is not 
unwarranted, is attested by the wide incidence of such tables furnished 
by the authors of personality, neurotic inventory and delinquency scales. 
A more important reason for omitting critical scores which might be 
used directly for diagnosis is that our list of “signs” is concerned only 
with the presence or absence of significant test scores. Such simple 
dichotomy obviously does not take into consideration the factors of 
configuration and of degree of concomitance of “signs,” which, as we 
have just noted, are at the very root of the problem of valid diagnosis. 
The appraisal of concomitance and interpretation of configurations 
cannot be made through simple statistical procedures, and at this stage 

^ The method of counting and subsequently interpreting “signs” suggested 
above is, of course, only one of several possible ways of evaluating the sort of data 
we have been dealing with. Instead of treating the individual test scores sepa- 
rately, an obvious next step would be to try combinations of certain selected tests, 
as for example, the sum of the Object Assembly and Picture Arrangement veraus 
the sum of the Block Design and Picture Completion test, etc. Or, instead of 
dealing with sums, one might use ratios between the scores of certain tests or com- 
bination of tests, as I. Rabin has done, (Test Score Patterns in Schizophrenia and 
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of our knowledge, is provisionally best replaced by clinical judgment 
and experience. 

In general, the greater the number of “signs” associated with any 
given mental disorder manifested by a psychometric, the greater the 
probability of a correct diagnosis being made on the basis of the ex- 
amination. However, the presence of only one or two “signs” when 
very marked, may be definitely pathognomic. For example, a very 
low Block Design score combined with a very low Object Assembly 
test score is definitely indicative of organic involvement, although some 
of the other signs may be absent. Sometimes, the “signs” are more 
numerous but not very decisive, and in these cases adjuvant aids must 
be taken into consideration. For example, mental defectives occasion- 
ally show test score distributions very much like that found in psycho- 
paths. In such a case, the global intelligence rating of the individual 
may well be the most important differential determinant. For ex- 
ample, if along with the other “signs”, the subject had an I.Q. of 65, the 
chances would be that we were having to do primarily with a mental 
defective, although presence of the added “signs” would indicate that 
we were dealing at the same time with an unstable and not easily ad- 
justable defective. 

On the other hand, it is sometimes extremely useful to distinguish 
between what might be termed “soft” and “hard signs”. “Soft signs” 
are differences in test scores which deviate only slightly from the normal 
or expected mean, but which are nevertheless similar in direction to 
those met with in given disease entities. “Hard signs” are test score 
differences which show marked divergence from the normal or expected 
mean score. “Soft signs” are ordinarily passed over because they are 
within the normal range of variation. But their systematic presence is 
very important because they sometimes enable one to make a diagnosis 
before definite or full blown physical symptoms are in evidence. A 
striking case o^ this kind was that of a 19 year old boy who was admitted 
to the hospital after an attack of dizziness and transient loss of memory. 
He had experienced several similar attacks before, which the mother 
described as of short duration. She said, “He [the patient] did not seem 


Non-Psychotic States, April 1941 meeting of Eastern section of A.P.A.) and as 
we have suggested in the discussion of indices of mental deterioration (p. 66). 
Finally, there is the technique of the multiple regression equation. This would 
be the most valid method of all, but its application to the problem of diagnosis 
would be beset with many difficulties, not the least of which would be the inability 
of most clinicians to apply it. 
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to remember and couldn’t get himself together”. The only event which 
the patient himself associated with his last attack was that on the night 
before he had “drunk two coca-colas”. A neurological examination on 
admission was essentially negative. He improved very quickly and was 
discharged after a short period of observation with a tentative diagnosis, 
largely based on the patient’s attitude toward his illness, of conversion 
hysteria. During this first stay, a psychometric examination (Wechsler- 
Bellevue) revealed a number of “soft signs” which led the psychologist to 
suggest the possibility of organic involvement. As there was no medical 
substantiation at the time of an organic brain disease, no special regard 
was paid the psychologist’s observation. Three months later the 
patient was readmitted to hospital with a fullblown brain tumor. 

We shall now present a series of cases to illustrate the diagnostic 
possibilities of the method above described, making use, so far as pos- 
sible, of the tabular data given on pages 150-151. 

ILLUSTRATIVE CASES 

Organic Brain Disease 


Information 14 Case 0-1 . Male, age 34, showing definite neuro- 

Comprehension .Jir.. 12 logical s ns including marked hydrocephalus, 

Arithooetic O 9 facial weakness, slight tremor, absent abdominals. 

Digits 13 Also suggested Babinsky on left side with mild 

Similarities 5k X.. 11 postural deviations on same side. Diagnosis- 

Verbal 59 post meningo-encephalitic syndrome. At age 

Picture Arr ^ 9 of 6 months patient had an injury with sequelae 

Picture Comp.. . . 1 .T. . . 8 lasting 6 months, which was diagnosed as menin- 

Block Design .T 4 gitis. This case shows the four most conspicuous 

Object Ass Z.t: 1 signs of organic brain disease: large discrepancy 

Digit Symbol. . . .T 3 between Verbal and Performance in favor of the 

Performance 25 former, very low Blocks combined with even lower 

Object Assembly and very low Digit Symbol. 

Verbal I.Q 115 While all of the test scores on the verbal part of 

Performance I.Q 74 the examination are average or above, the two 

\ ^ lowest are Similarities and Arithmetic which are 

Digits forward ... 8 in line with the organic picture. The only excep- 

Digits backward 6 tion is the Digit Span which is good for both 

forwards (8) and backwards (6). 

Vocabulary 10 CaseO-B. Male, married, age 54, fireman. En- 

Information 11 tered hospital with complaints of headaches and 

Comprehension. ........ 11 forgetfulness. History of old skull fracture. 

Arithmetic. 6 Physical examination, including blood-Wasser- 

Digits 9 mann and flat plates were negative. While under 
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Similarities 7 observation referred by Neurological service for 

Verbal 45* psychometric. On Bellevue Adult patient at- 

Picture Arr 7 tained a rating of average intelligence with sub- 

Picture Comp 10 test score distribution as indicated in summary at 

Block Design 7 left- The distribution shows the following or- 

Object Ass 11 ganic signs: Arithmetic — low, Similarities — poor, 

Digit Symbol 4 Blocks — poor, Digit Symbol — very poor. Digit 

Performance 39 Span appears to be unimpaired, but actually 

average score of 9 was due primarily to subject's 

Verbal I.Q 103 relatively good span for digits forward (7); back- 

Performance I.Q 103 wards span was only 4 digits. A very low digit 

backwards in a person of otherwise normal in- 

Digits forwards 7 telligence is generally characteristic of organic 

Digits backwards ... . 4 brain disease. This patient, when his age is 


considered, seemingly manifests relatively little** 
deterioration but is of special interest because 
he shows many of the signs of organic brain dis- 
ease; the one conspicuous e.xception is the rela- 
tively high Object Assembly score; but a good 
Object Assembly is occasionally found in organic 
cases who have not as yet deteriorated markedly. 


Vocabulary 7 Case OS, White, male, adolescent, age 14, 

Information 8 Brought to hospital because of marked change in 

Comprehension 9 personality. Had been normal boy until 6 months 

Arithmetic 7 prior to admission. Illness first manifested by 

Digits 6 failure at school and increased irritability. Phys- 

Similarities 9 ical and neurological examination on admission 

Verbal 39 essentially negative. Case presented to illustrate 

Picture An* 6 value of Scale in detecting possible organic brain 

Picture Comp 1 conditions prior to manifestation of neurological 

Block Design 1 symptoms. Psychometric organic signs are: 

Object Assembly 2 Verbal much higher than Performance; very low 

Digit Symbol. 5 scores on both Object Ass. and Block Design; large 

Performance 15 discrepancy between Digits Forwards and Digits 

Backwards. On the qualitative side, subject 

Verbal I.Q showed conunon organic manifestation of being 

Performance I.Q able to reproduce designs if presented with a 

model of assembled blocks (see p. 92), after fail- 

Digits Forwards 6 ing completely with the usual form of presen ta- 

Digits Backwards 3 tion. 


Schizophrenia 

A 

Comprehension. , .y 10 CaseS-1. Male, age 41, war veteran, 3 years 

Arithmetic .9 10 college. Long history of inadequacy and mal- 

♦ When Vocabulary is included, the Verbal score is obtained by taking 5/6 
of total. 

** ActuaDy it is very considerable. See p. 67. 
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Information. . .. 


15 

Digits 

, ..I.. 

.... w ... * 

14 

Similarities 


11 

Verbal 


60 

Picture Arr 


8 

Picture Comp.. 

P 

13 

Block Design. . 


10 

Object Ass 


•12 

Digit Symbol. . 


4 

Performance. 


47 

Verbal I.Q 

116 

Perform. I.Q... 


95 


adjustment. Has been at various mental hospi- 
tals and repeatedly diagnosed as paranoid 
praecox. Latest admission followed threats of 
violence to mother. Well preserved, no hallucina- 
tions or delusions elicited; but insists that mother 
and veterans hospitals are against him. Patient 
is of better than average intelligence. The 
schizophrenic signs in his psychometric are; 
Verbal much higher than Performance, low 
Picture Arrangement, very low Digit Symbol 
simultaneously with high Digit Span, large dis- 
crepancy between Information and general Com- 
prehension in favor of the former. High intertest 
scatter within Performance group. 


Information 10 

Comprehension 6 

Arithmetic 6 

Digits 6 

Similarities 1 

Verbal 29 

Picture Arr 4 

Picture Comp 10 

Block Design 1 

Object Ass 2 

Digit Symbol 2 

Performance 19 

Verbal I.Q 83 

Perform. I.Q. 71 



Vocabulary 10 

Information 10 


Case S-S, White, male, age 39, elevator oper- 
ator. This patient shows marked deterioration, 
generally seen only in old cases but occasionally 
also in cases of relatively short duration. In his 
case, reported onset of disease was about six 
months prior to administration of test. First 
indication that something was radically wrong 
with patient occurred when he left his job with 
no apparent reason; said he was nervous and had 
no peace of mind. Later complained police were 
after him. On admission to hospital was bewil- 
dered, kept to himself but was passively coop- 
erative. Although diagnosed as paranoid 
schizophrenic, general behaviour was that of a 
simple or mixed type. Psychometrieally, he 
showed the following schizophrenic signs: Verbal 
higher than Performance, low Digit Symbol with 
much better Digit Span, low Object Assembly, 
zero scores on Similarities and Block Design, 
high Information- Most outstanding of all, 
very large intertest variability ranging from a 
score of zero to a score of ten. The very low 
scores on the Object Assembly and Digit Symbol 
together with zero score on the Block Design 
taken alone would suggest organic brain disease, 
but in that case we would also get a low Picture 
Completion and not an average score on this test. 
The inconsistency here is what definitely shows 
this case to be schizophrenic. Similarly, only a 
schizophrenic would give an average score on 
Information and a zero score on Similarities. 

Case j 8-5. White, female, age 38. Admission 
following arrest for refusing to take shelter during 
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Comprehension 7 

Arithmetic 9 

Digits 11 

Similarities 6 

Verbal 43 

Picture Arr 11 

Picture Comp 10 

Block Design 13 

Object Assembly 6 

Digit Symbol 13 

Performance 53 

Verbal I.Q 97 

Performance I. Q 113 


Comprehension 11 

Arithmetic 6 

Information 10 

Digits 6 

Similarities 5 

Verbal 38 

Picture Arr 12 

Picture Comp 10 

Block Design 15 

Object Ass 16 

Digit Symbol 12 

Performance 65 

Verbal I.Q 90 

Perform. I.Q 123 

Comprehension 8 

Arithmetic 6 

Information 6 

Digits 6 

Similarities 6 

Verbal 32 

Picture Arr.. 10 

Picture Comp 8 

Block Design 7 

Object Ass 11 

Digit Symbol 9 

Performance 45 

Verbal I.Q 82 


blackout raid and committing commotion. His- 
tory of previous commitment to State Hospital. 
On admission agitated, hallucinatory, seclusive, 
manneristic; threatens to kill doctor. Psycho- 
metrically of interest as a case of schizophrenia 
with Performance higher than Verbal: also as a 
case which though failing to show many of the 
listed signs, nevertheless reveals a test pattern 
which is definitely schizophrenic. The diagnos- 
tic signs present are a combined low Similarities 
and low Comprehension; Object Assembly much 
below Block Design; sum of Comprehension and 
Object Assembly much lower than sum of Infor- 
mation and Block Design; large intertest vari- 
ability. 

Adolescent Psychopaths 

Case P-1. White, male, age 15, 8th grade. 
Continuous history of stealing, incorrigibility 
and running away. Several admissions to Belle- 
vue Hospital, the last one after suicidal attempt. 
While on wards persistently created disturbances, 
broke rules, fought with other boys and con- 
tinuously tried to evade ordinary duties. Psy- 
chopathic patterning; Performance higher than 
Verbal, low Similarities, low Arithmetic, sum 
of Picture Arrangement plus Object Assembly 
greater than sum of scores on Blocks and Picture 
Completion. 


Case P-S. Male, age 16, schoolboy, 8th grade. 
Readmitted for observation from Correctional In- 
stitution where he was reported to have thrown 
acid into face of one boy and was suspected of 
pushing another into an open fire. History of 
outbursts, of violent temper and cruelty from 
early childhood. Would throw objects at least 
provocation and took pleasure in inflicting pain 
on others. Once pushed a boy, who he knew 
could not swim, off dock in hope of witnessing a 
drowning. Psychopathic patterning; Perform- 
ance higher than Verbal, good Object Assembly, 
sum of scores on Object Assembly and Picture 
Arrangement greater than sum of scores on Block 
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Perfonaance I.Q 92 Design and Picture Completion. This subject 

does not show all the psychometric signs of the 
psychopath but the sum of Object Assembly plus 
Picture Arrangement is markedly higher than that 
of Picture Completion plus Block Design, and that 
in itself is almost pathognomonic. 


Information 6 Case P-S. White, male, school-boy, age 14, 

Comprehension 9 8th grade. Boy a behaviour problem since age 

Arithmetic 7 of 3. Never responded to either cajoling or 

10 punishment. Brought to court for stealing, stay- 

Similarities 6 ing away from home days at a time and con- 

Verbal 38 tinued incorrigibility. Psychiatric diagnosis: 

Picture Comp 12 Psychopathic personality. This subject shows 

Picture Arr 13 almost all the psychometric signs for this diag- 

Block Design 11 nosis. Performance is higher than Verbal; sum 

Object Ass 13 of Picture Arrangement and Object Assembly is 

Digit Symbol 9 greater than sum of Picture Completion and Block 

Performance 58 Design. Information is low and Similarities low. 

Digit Symbol just at mean of all subtest scores 

Verbal I.Q — 90 but still lower than any of Performance. High- 

Performance I.Q 115 est Verbal score is on Digit Span. This is not 

typical of all psychopaths but rather characteris- 
tic of a certain group who have “good memory’'. 


Neurotics 

Comprehension 14 Case iV-1. Male, age 25, college graduate, 

Arithmetic 13 secretary. Complaints: easy fatigability, lack 

Information 14 of concentration, sexual difficulty, and general 

Digits 10 S 3 mptoms associated with Neurasthenia. Has 

Similarities 13 been under psychiatrist’s care. Neurotic pat- 

Verbal 64 terning: Verbal higher than Performance, high 

Picture Arr 10 Comprehension and Information with relatively 

Picture Comp 14 low Digit Span. Low Picture Arrangement and 

Block Design 15 low Digit Symbol, Object Assembly less than 

Object Ass 12 Block Design. Negative sign: relatively good 

Digit Span 9 Arithmetic, but in this connection account must 

Performance 60 be taken of fact that subject is a college graduate 

with B.S. degree. 

Verbal I.Q 120 

Performance I.Q 116 

Comprehension.. 10 CaseN-2, Female, age 15, school girl, High 

Arithmetic 6 School — Ist term. Difficulties began when sub- 
information 10 ject entered high school. Commercial courae. 

Digits 7 which subject has been failing, obviously un- 

Similarities 7 suited to girl’s ability. Main symptoms: gas- 

Verbal 40 trie distress and vomiting, unhappiness at school. 

Picture Arr 7 Physical examination essentially negative. Diag- 
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Picture Comp 9 nosis: Conversion Hysteria. Neurotic pattem- 

Block Design. 10 ing: poor Arithmetic and Digit Span with 

Object Assembly 7 relatively good Information and Comprehension, 

Digit Span 8 low Picture Arrangement, Object Assembly less 

Performance 41 than Block Design. However, total Verbal and 

Performance scores are not markedly different. 

Verbal I.Q 93 This is not uncommon in duller neurotics (es- 

Performance I.Q.... ^7'" pecially of Hysterical type). Also to be taken 

""7 ' into account is the fact that the subject comes 

from bilingual home. Low Similarities in this 
case only an indication of subject’s linaited 
intelligence. 


We shall conclude our discussion of the diagnostic material elicitable 
from the Bellevue Scale by adding a few notes on what might be termed 
“qualitative indicators”. These indicators are significant items which 
reveal themselves either in the form or in the content of the subject's 
responses. The perseverative, redundant quality of the schizophrenic’s 
definitions on the Vocabulary test is an example. Another is the nega- 
tivism which reflects itself in a subject’s tendency to say or to do the 
opposite of what is requested. This is frequently brought out by the 
subject’s responses to the Similarities test on which, instead of giving the 
asked for likenesses, he responds with gratuitous differences. The 
tendency to give differences when likenesses are called for, is also en- 
countered in subjects of limited intellectual endowment (e.g., young 
children and mental defectives). But when given repeatedly by a 
person of above dull-normal or better intelligence it is almost always 
pathognomic of schizophrenia. The following, for example, are the 
responses to the Similarities test questions, made by schizophrenic 
patient (case S-2) who insisted upon giving differences even after re- 
peated correction, and ended by making a zero score on the test: 
Orange-banana: orange is round, banana is long 
Coat-dress: a coat hangs over the dress 
Dog-lion: a lion is larger 

Wagon-bicycle: wagon has four wheels, bicycle has two wheels 
Daily paper-radio: you hear news on radio, read stories in newspaper 
Wood-alcohol: alcohol is white, wood is any color 
and so on lor all twelve test items. 

Sometimes a patient merely persists with a stereotyped reply, “they 
are not alike”. In such cases it is generally impossible to change his 
set, but even when one succeeds in so doing, the usual result is to elicit 
a difference instead of a likeness. For example, one patient after sys- 
tematically answering “not alike” to the first four Similarities questions, 
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altered his reply after much urging to the fifth question (Newspaper- 
radio) “one is for news, the other is for entertainment.’’ 

A common characteristic of schizophrenics and occasionally of certain 
types of psychopaths is the “spoiled” or contaminated response. The 
subject first gives a good or passing response, but then “spoils” it by an 
irrelevant or eccentric addition. Such responses are most frequently 
elicited by the Comprehension questions, occasionally, also, by the 
Similarities questions. Thus, Why are shoes made of leather? — “It fits 
(the foot) easily, it is an old custom.” (schizophrenic) Why does land 
in the city cost more than land in the country? — “In the city land be- 
comes more valuable because people need space while in the country 
acres are not so much in demand. In that way the law of supply and 
demand becomes counterbalances.” (schizophrenic) What is the thing 
to do if you find a letter^ etc? — “Might mail it, but first I’d open it up 
to see if there was any money in it.” Suppose there wasn't any money 
in it? — “Tear it up.” (psychopath). 

Sometimes abnormal trends are indicated by the question which a 
subject may ask. Thus, before answering the arithmetical problem, 
If a man buys 6^ worth of stamps and gives the clerk 10^ how much 
should he get backf^ a schizophrenic wanted to know whether they were 
“2 or 3 cent stamps”. At other times a casual question by the examiner 
will be equally effejictive in eliciting a telltale response. The following is a 
rather amusing example. How far is it from New Y ork to Paris? Patient : 

don ’t know.” Examiner: ‘‘Try to figure it out.” Patient: “Well, it 
takes about a week to get from Paris to New York. There are seven 
days in a week and twenty four hours in a day; so multiply 24 by 7 and 
you get 161 which equals the hours in* seven days or one week. Now 
there are twenty blocks in a mile; so multiply 161 by 20, and this gives 
you 3,220. The distance from Paris to New York is 3,220 miles.” 
The arithmetic is not quite accurate but more important than the slight 
error in multiplication is the way in which the subject arrived at the 
almost correct answer. The response was made by a schizophrenic 
with manic features. 
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THE BELLEVUE INTELLIGENCE TESTS 

GENERAL INSTRUCTIONS 


In administering the tests it is absolutely essential that the examiner 
follow directions as given. Until he has memorized the directions in 
their entirety, the examiner should read them from the manual. The 
examiner should not engage the subject in an interview while giving 
the tests; the only comments allowable are such as are needed to en- 
courage the subject. Directions may be repeated as often as necessary 
but not explained. If the subject does badly on any given test, say, 
‘‘That was a little hard. Lei us try another which is easier, and proceed 
with a test on which he is likely to succeed. The tests need not be 
given in the exact order listed; for adults a good test to start with is the 
General Information: with children it is well to begin with the Object 
Assembly. 

The Bellevue Intelligence Examination consists of 10 tests, 5 of which 
are described as Verbal and 5 as Non-Verbal, or Performance. Where- 
ever practicable it is desirable to give the Full Scale. This is particu- 
larly true in the case of adolescents where the examiner may wish to 
use the results for vocational guidance. There are three exceptions to 
the rule: First, cases where the subject suffers from some incapacitating 
physical handicap or defect (blindness, paralysis, etc.). In such in- 
stances, of course, only the Verbal part of the scale will be used. Sec- 
ond, illiterates and foreigners; with such subjects, only the Performance 
part of the scale may have to be used. We say may because our ex- 
perience has shown that, even when subjects have a limited English 
vocabulary, they are not likely to be penalized to any great extent when 
given the Verbal tests. Third, individuals 50 years and older. In the 
case of older persons it is sometimes necessary to omit one or another of 
the tests in order to avoid penalizing these subjects for certain disabili- 
ties such as diminished vision or partial deafness. With such subjects 
a battery of 8 or 9 tests may be substituted for the Full Scale. Wlien 
this is done one prorates for the omitted tests. But on such occasion 
omission of any test should be decided upon before the test is given, not 
afterwards. It is not permissible to drop out a t^t simply because a 
subject does badly on it. In any case, no rating should be given on a 
‘TulP^ scale examination consisting of less than 8 tests; in most cases at 
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least 9 tests should be employed. Our own experience has shown that 
one seldom has to omit more than one test. 


DIRECTIONS FOR THE INDIVIDUAL TESTS 
1. General Information 

Directions: Read questions as stated and in order^ given. The first 
question '^Who is President of the United States?’’ is used as a buffer 
and not scored. Most of the questions are easily marked right (+) 
or wrong (— ), but the examiner should familiarize himself with the 
right answers. Ordinarily, it will be possible to score a response di- 
rectly as plus or minus, but when the subject’s answers are doubtful 
they should be written out. It is always legitimate to say ^'Explain 
more fully but not to give leading questions.® 

The questions with their correct answers are as follows: 


(a) Who is the President of the U. S.? 

1. Who was President before him? 

2. What is a thermometer? (5) 

3. What does rubber come from? (4) 


4. Where is London? (2) 

How many pints make a quart? (3) 

How many weeks are there in a year? (6) 

7. What is the capital of Italy? (7) 

8. What is the capital of Japan? (16) 

9. How tall is the average American wo- 

man? (9) 

10. Who invented the airplane? (10) 

11. Where is Brazil? (12) 

12. How far is it from Paris to New York? 

( 11 ) 


(Franklin) Roosevelt® 

(Herbert) Hoover® 

(Instrument, thing, etc.) for 
measuring temperature. 

Trees; sap of tree; rubber 
plant (if subject does not 
mean house variety)* 

England 

Two 

52 

Rome 

Tokyo 

Any answer from 5' 3^ to 5' 6^ 
(inclusive) 

Wright Brothers (or Langley) 
South America 

2000 to 3500 miles (any figure 
within limits) 


1 The order of the questions is different from that given in the first and second 
editions. The changes were made upon the basis of supplementary data com- 
municated to the author and, more especially, of an item analysis made by Lt. Wm. 
D. Altus, AGD, U.S.A. The new ranking corresponds more nearly to the diffi- 
culty of the items for adults, at the present time (1944). The original order of 
the items is indicated by the numbers in parenthesis following each question. 

* For example; If the subject answers *Tt beats” to the question "What does 
the heart do?” the examiner may not say "And what else?”. 

* The correct responses will naturally have to be altered with time. 

^ Petroleum and alcohol may also be credited plus. 
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13. What does the heart do? (17) 

14. Who wrote Hamlet? (13) 

15. What is the population of the U. S. (18) 

16. WTien is Washington’s birthday? (8) 

17. Who discovered the North Pole? (14) 

18. Where is Egypt? (20) 

19. Who wrote Huckleberry Finn? (19) 

20. What is the Vatican? 

21. What is the Koran? 

22. Who wrote Faust? 

23. What is an Habeas Corpus? 


24, What is Ethnology? 

25. What is the Apocrypha? 


Pumps blood (through the 
body) 

Shakespeare 

100 to 150 million (any figure 
within limits) 

February 22nd ^ 

Peary 

Africa 

Mark Twain (or Samuel 
Clemens) 

Home of the Pope or papal 
government 
Mohammedan Bible 
Goethe (or Gounod, if opera is 
specified) 

A writ requiring a person in 
custody to be brought before 
the court 
or 

A writ obtained by a person to 
demand of the court the rea- 
son for his detention and (if 
no sustaining charge is forth- 
coming) that he be freed 
The study of the origin of races 
Books of disputed authority in 
Old Testament 
or 

The reputedly unauthenticated 
or unacceptable versions of 
the Bible 


Scoring: Each response is scored plus or minus. Ordinarily it will 
sufldce to continue imtil 5 consecutive questions are successively failed. 
But unexpected successes are not uncommon and occasionally it may be 
necessary to disregard this rule and give the entire list. If, for some 
reason, the examiner suspects that a subject may be acquainted with 
the correct answer to any particular question, he should ask it, even 
though the item is below the 5 succ^ive failure point. The total score 
is the sum of plus responses. 

Maximum score: 25 


2. General Comprehension 

Directions: Be sure that the subject is attending when you give the 
questions. Very youi^ and very old subjects sometimes find it diffi- 
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cult to remember the entire question from a single statement of it. It 
is, therefore, always permissible to repeat the question. In fact, it is a 
good practice to repeat the question, if no response is obtained after 10 
or 15 seconds, but do not abbreviate or alter the wording. Record 
subject’s responses verbatim. Sometimes it is necessary to encourage 
the subject. It is done with such remarks as, “(zo ahead” 

etc. If response is not clear, add ^'Please explain further,” Except 
for low grade individuals, ask ail the questions. 

QUESTIONS 

. 1. What is the thing to do if you find an envelope in the street, that is sealed, 
and addressed and has a new stamp? 

- 2. What should you do if while sitting in the movies (theatre) you were the 

first person to discover a fire (or see smoke and fire)? 

3. Why should we keep away from bad company? 

- 4, Why should people pay taxes? 

~ 5. Why are shoes made of leather? 

_6. Why does land in the city cost more than land in the country? 

If you were lost in a forest (woods) in the daytime, how would you go about 
finding your way out? 

8. Why are laws necessary? 

9. Why does the state require people to get a license in order to be married? 

10. Why are people who are born deaf usually unable to talk? 

Two alternate questions are added in case a question is invalidated. 
It is not permissible to substitute the alternates for the above, merely 
because a subject has failed on a particular question. 

AUemates: 

Why should we save money? (For questions 1-3.) 

Whiat should you do if a dear friend of yours asks you for something you 
haven’t got? (For questions 4-10.) 

Scoring: Refer to Criteria sheets for answers. The responses are 
scored either 0, 1 or 2, depending upon the degree of generalization and 
the quality. In some cases the scoring is fairly obvious, but in others 
frequently di ffi cult. The examiner should match his responses against 
the sample answers given on the scoring sheets. The samples are 
drawn from more than a thousand cases, but from time to time the 
examiner may find some imusual response which is not typified. In 
these cases he will have to use his own judgment. But it is not permis- 
sible for him to disagree with the credits allowed on the scoring sheets, 
because the norms are based on the credits as assigned. Total score, 
is the sum of the credits on the 10 questions. 

Maximum score: SO, 
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3. Arithmetical Reasonixg 

Directions: There are 10 problems, the first 8 of which are given 
orally and the last 2 handed to the subject on a printed card with the 
instructions, ‘^Now read this one aloud and do the 'problem mentally 
In the case of young children and subjects likely to be defective, it is 
well to begin with the first example, but with bright children and aver- 
age adults it is better to begin wdth a problem which the examiner has 
reason to believe the subject will solve readily, but which is not “too 
easy’’. With the average adult one may begin with problem 3, allow- 
ing credit for the easier problems. Continue dowmward until three 
successive examples are failed. In presenting the first problem say, 
“J want to see how good you are in arithmetics^ The w^ording of the 
questions may not be altered, but it is always permissible to repeat the 
problem. In fact, it is desirable to do so to make sure that the sub- 
ject is not forgetting it in the process of trying to solve the example. 
The first five problems are solved almost immediately by subjects who 
know the correct answers, but liberal time is allowed (see time limits). 
If the subject does not answer within the time limit, proceed with the 
succeeding questions, saying, ^^Now try this one.” (Record time in the 
case of the last two problems.) 

The time limits are 15 seconds on questions 1, 2 and 3; 30 seconds on 
4, 5 and 6; 60 seconds on 7 and 8; and 120 seconds on 9 and 10. Time 
credit is allowed only on the last two problems. 

Scoring: Credit of 1 for each problem answered correctly wdthin 
time limit; give additional credit of 1 for time on problems 9 and 10, 
if answer is given within 40" and an additional credit of 2 if answer is 
given within 15". (Ti m ing to begin after subject has finished reading 
problems.) 

Maximum score: 14 ^ 


PROBLEMS 

Time t Answer 

(15'0 1. How much !§ four dollars and five dollars? 9 

(15^) 2. If a man buys six cents worth of stamps and gives the clerk 

ten cents, how much change should he get back? 4 

(15"") 3. If a man buj^ eight cents worth of stamps and gives the 
clerk twenty-^ve cents, how much change should he get 
back? 17 

(300 4. How many oranges can you buy for thirty-six cents if one 

orange costs four cents? 9 

(30^) 5. How many hours will it take a man to walk twenty-four 

miles at the rate of three nules an homf S 
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Time Answer 

(30") 6. If a man buys seven two cent stamps and gives the clerk a 

half dollar, how much change should he get back? 36 

(60") 7. If seven pounds of sugar cost twenty five cents, how many 

pounds can you get for a dollar? 28 

(60") 8. A man bought a second hand car for two-thirds of what it 
cost new. He paid $400 for it. How much did it cost 
new? $600y 

(120") 9. If a train goes 150 yards in ten seconds, how many feet can 

it go in one fifth of a second? 9* 

(120") 10. Eight men can finish a job in six days. How many men will 

be needed to finish it in a half day? 96 


4. Digits Forward and Backward 

These are administered as separate tests, but the number of digits 
correctly reproduced on each are added to give a total score. 

(4a) Digits Forward; Directions: Use set of digits given below. 
Say, am going to say some numbers. Listen carefully y and when I 
am through, say them right after me” If subject repeats series correctly, 
mark plus and continue with the next higher series. If subject fails, 
give second trial on series of equal length. Discontinue after subject 
has failed on both trials of a given series. Score is highest number of 
digits repeated without error on either of two trials. Thus, if subject 
repeats correctly 5 digits forward, he gets a score of 5. Read digits at 
rate of one per second. 

(4b) Digits Backward: Say, ^^Now I am going to say some more 
numbers, hut this time when I stop, I want you to say them backwards. 
For example; if I say 7-1-9, you say (Pause) 9-1-7, If subject gives them 
correctly, say, ^‘Thafs right,” If subject does not seem to understand, 
use another example. Always begin with the 3 digit series, continuing 
imtil subject fails both trials of the series. If subject fails the 3 digit 
series, it is permissible to give him the 2 digit series and allow him a 2 
score if he is able to reverse them. Only use the 2 digit series when 
there is failure on both of the 3 digit series. Score is the highest number 
of digits correctly reversed. 

The total score on tests is the sum of the number of digits repeated 
forward and backward. Thus, if subject repeats 7 forward and 4 
backward, his total score is 11. 

Maximum score: 17, 

If subject answers, “3 yards’’, say, 'Vead it again”] if answer is corrected to 
9 feet, score plus. 
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DIGITS FORWAED 


(3) 

5,8,2 

6,9,4 

(4) 

6, 4, 3, 9 

7, 2, 8, 6 

(5) 

4, 2, 7, 3, 1 

7, 6, 8, 3, 6 

(6) 

6, 1, 9, 4, 7, 3 

3, 9, 2, 4, 8, 7 

(7) 

5, 9, 1, 7, 4, 2, 8 

4, 1, 7, 9, 3, 8, 6 


5, 8, 1, 9, 2, 6. 4, 7 

3, 8, 2, 9, 6, 1, 7, 4 

tST 

2, 7, 5, 8, 6, 2, 5, 8, 4 

7, 1, 3, 9, 4, 2, 5, 6, 8 


DIGITS BACKWARD 


(2) 

(2,4) 

(5, 8) 

(3) 

6,2,9 

4,1,5 

(4) 

3, 2, 7, 9 

4, 9, 6, 8 

(5) 

1, 5, 2, 8, 6 

6, 1, 8, 4, 3 

(6) 

5, 3, 9, 4, 1, 8 

7, 2, 4, 8, 5, 6 

(7) 

8, 1, 2, 9, 3, 6, 5 

4, 7, 3, 9, 1, 2, 

(8) 

9, ’4, 3, 7, 6, 2, 5, 8 

7, 2, 8, 1, 9, 6, 5, 3 


5. Similarities 

Directions: Say, am going to name two things which are the same or 
alike in certain ways and I want you to tell me in what way they are alike^ 
For example: what way are an orange and banana the same? (Pause). 

If subject answers with the reply ‘^They are fruits,’^ say, ^Hhat is right, 
and proceed with the list. If his response is ^'They are not alike^\ the 
examiner adds, ^^They are alike in some ways; tell me one way in which they 
are alikeJ’ If subject fails to ans^wer within 10 to 15 seconds, expresses a 
difference or gives an inferior reply, the examiner says, ell, you might 
say you can eat them both, or that they both have shins, or that they are both 
fruitsJ^ Then proceed with the list as given. Continue until 4 succes- 
sive pairs are completely failed. Kecord answers verbatim. The follow- 
ing is the list of Similarities to be read: 


1 ^1. Orange 

Banana 

1 ^2. Coat 

Dress 

ys. Dog 

Lion 

1 1 4. Wagon 

Bicycle 

1 1 5. Daily paper 

Radio 

^ 0 6. Air 

Water 

b 0 7. Wciod 

Alcohol 

1 8. Eye 

Ear 

^0 9. Egg 

Seed 

^10. Poem 

Statue 

^ 11. ftaise 

Punishment 

0 12. Fly 

Tree 
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Scoring: Responses are scored either 0, 1 or 2, depending upon the 
degree and quality of the generalizations. Use criteria illustrations for 
evaluating the responses. The examiner matches his responses against 
the sample answers and scores accordingly. Sometimes the subject 
^‘spoils^’ an answer by an additional response. K the original response 
was scored 2, it reduces the score to 1. Spontaneous emendations are 
allow^ed. In such cases, the examiner asks, ‘Woti? which one is it?” 

Total score on Tests is the sum of the partial credits on the 12 questions. 

Maximum score: 2^. 

6. Picture Completion 

Directions: There are 15 cards in this Test, each of which has a part 
missing. The cards are presented to the subject in numerical order, 
and he is asked to name or indicate on each card the missing part. 
Prior to the presentation of the first card, the examiner says, am 
going to show you some pictures in which there is a part missing, I 
want you to look at each picture carefully and tell me what it is. Now look 
at this picture” (presenting card No. I). ^‘What important part is 
missing?” If correct answer is given, continue with succeeding pic- 
tures saying, ^‘Now what is missing in this one?” If subject fails to de- 
tect omission, indicate to him, thus, see, the nose is missing,” 

If he fails on the second picture also, he is helped again, thus, ^^You 
see here half of the mustache is missing,” But beginning with the third 
picture no further help is given, the examiner merely repeating the 
formula **Now what is missing in this picture?” at each successive 
presentation. Sometimes the subject mentions an unessential missing 
part. The time this occurs, the examiner may say, “Fes, hut what 
is the most important thing missing?” The examiner may not repeat® 
this comment for the remaining presentations. Allow a maximum 
exposure of 15^ to 20^ per picture. If the subject does not indicate the 
missing part within this time, score as failure and continue with the su- 
ceeding picture. Except for very low grade individuals, present the 
entire series of pictures. 

Scoring: The score is the number of pictures for which correct re- 
sponses are ^ven. In the case of picture 11 (the reflection of the arm 
in the mirror) if subject says, “leg is missing,” say, “Fes and what 
dsef” But if correct response is still not given, subject's response is 
scored as a failure. 

Maximum score: 15, 

• Except in case of card 11. See footnote on opposite page. 
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PICTURE COMPLETION 



Picture 

Missing Part ^ 

1. 

Girl 

Nose 

2. 

Man 

Mustache , 

3. 

Man 

Ear j 

4. 

Card 

Diamond 

5. 

Crab 

Leg 

6. 

Pig 

Tail 

7. 

Boat 

Stacks ^ 1 

Knob 1 

8. 

Door 

9. 

Watch 

Hand 

10. 

Pitcher 

Water ^ 

11. 

Mirror 

Reflection of arm 

^12. 

Man 

Tie ' 

13. 

Bulb 

Thread (or prong) ^ 

14. 

Girl 

Eyebrow- 

15. 

Sun 

Shadow of man 


7. Picture Arrangement 

Directions: Present first sample series in mixed order as indicated on 

back of the card. 

Say, “The pictures tell a story about a bird building 

its nesL As you see, they are in the wrong order, but if you put them in 
the right order — ‘Hike this ^^ — (the examiner starts rearranging the cards) 
— “the pictures will tell a sensible story. Watch me. In the first pic- 
ture the bird is building its nest, the next picture shows the eggs which 
the bird has laid, and the last picture shows the bird feeding its young 

which have been hatched (Pause). 

‘Wow I have some other sets of pic- 

tures which I want you to arrange. 

In each case they are mixed up and 

what I want you to do is to put them in the right order so that they make the 

most sensible story. 

Try this seV\ 

(E. presents set 1.) 


PICTURE arrangement 


Order of Presentation 

Correct order Time limit 

Sample Nest 

1, 2,3 


1. House 

1,2,3 

PAT 1 minute 

2, Hold Up 

1, 2, 3. 4 

A B C D 1 minute 

3. Elevator 

1, 2, 3, 4 

L M N 0 1 minute 

4. Flirt 

1, 2, 3. 4, 5 

JANET 2 minutes 

5. Taxi 

1, 2, 3, 4, 5, 6 

SAMUEL 2 minute 

6 . Fish 

1, 2, 3, 4, 5, 6 

E F G H I J 2 minutes 


* If S. says “buttons” (on shirt), E. says “Yes and what else”; and if S. then 
idves correct response score plus. 
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On the back of the pictures is indicated the order of arrangement 
for presentation, given by numerals 1, 2, 3, 4, etc., depending upon the 
number of cards. The correct order of rearrangement is indicated by 
.code letters. There are time limits for each picture, and in the case of 
the last two, “FISH’' and “TAXI", credit is given if they are completed 
vithin 40 seconds. 

Record subject’s actual arrangement by noting letters on back of 
successive cards, also note time required by subject to complete the 


TABLE 31 

ACCURACY CREDITS FOR PICTURE ARRNGEMENT 


Series 

Order 

Accuracy 

Credits 

Time Credits 

1. House 

PAT 

2 

None 

2. Hold Up 

ABCD 

2 

None 

3. Elevator 

LMNO 

2 

None 

4. Flirt 

JANET 

B 

None 


JNAET 

3 

None 


AJNET 

2 

None 

6. Fish 

EFGHIJ 

3 

As indicated in table 


EFHGIJ 

2 

As indicated in table 


EGFHIJ 

1 

None 


EFGIHJ 

1 

None 

6. Taxi 

SAMUEL 

3 

As indicated in table 


SAMEUL 

2 

As indicated in table 


SALEUM 

1 

None 


SALMEU 

1 

None 


SALMUE 

1 

None 


SALUEM 

1 

None 


TABLE 32 


Time bonus for “Fish” and “Taxi” (if accuracy scores are 2 or 3) 


Time in Seconds 
1-20 
21-30 
31-40 

, 41 and over 


Bonus CrediiM 

3 

2 

1 

0 


task. Some subjects after completing the arrangement pause to sur- 
vey it, and it is sometimes difficult to ascertain whether they have or 
have not finished. In such cases it is well to suggest to the subject, 
“TefZ me when you are finished 

Scoring: The first three series are scored either right or wrong. 
On the last three, partial credits are allowed for imperfect but neverthe- 
less sen^ble arrangement. In the case of the last two series additional 
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credits are given for satisfactory arrangements within various time 
limits. Use tables 31 and 32 for obtaining the different scores allowed. 
The final score on this test is the sum of partial credits received on the 
separate items. 

Maximum credits: 21, 



8. Object Assembly 

Directions: There are three different items on this test: a Manikin^ 
a Profile and a Hand, Present these in order named, making sure that 
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in each case the pieces are arranged as indicated on the diagrams. (See 
figures 10, 11 and 12.) 

(a) Manikin: Say, “7/ these pieces are put together correctly they will 
form something. Go ahead and put them together just as quickly as you 
can” Time limit, B minutes, 

(b) Profile: Say, this together as quickly as you can,” Time 

limit, 3 minutes. Record time in seconds. 

(c) Hand: Say, *'Put this together as quickly as you can,” Time limit, 
3 minutes. Record time in seconds- 



Scoring: Part of this test (the Manikin) is scored for accuracy alone; 
and part for both time and accuracy (the Profile and Hand). When 
both accuracy and time are credited, the subject’s score on the test 
items is the arithmetical sum of his time and accuracy credits. The 
scoring for each of the items is as follows: 

(a) Manikin: Score 1 point for each piece properly placed and 1 
point for neat fitting of ail joints. Perfect performance = 6; all cor- 
rect except that joints are not well fitted, score == 5; one reversal (of 
either arms or 1^) = 4; one reversal, and remaining pieces not prop- 
erly fitted = 3; two reversals, that is, both of arms and legs = 2; 
head only correctly in place 1. No time bonus. Maximum score 
on Manikin: 6, 
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(b) Profile: Accuracy credits: 1 point for each piece in correct posi- 
tion, making a maximum of 6 credits. Thus, nose, chin and eye pieces 
properly placed = 3; ear,^ nose and skull pieces in position = 4; both 
ear pieces alone, correctly in place = 2. 

Time credits: From 1 to 4 points, according to limits given in table 33. ♦ 
Time credits are allowed only if subject’s accuracy score is 5 or 6. 

Full score on Profile is the sum of accuracy and time credits. 

(c) Hand: Accuracy credits: 1 point for each piece in correct posi- 
tion and if attached to palm, making a possible maximum for accuracy 
alone, 6. 

Time credits: From 1 to 4 points according to limits given in table 33. 
Time credits are allowed only if subject’s accuracy score is 5 or 6. 

Full score on the Hand is the sum of accuracy and time credits. 

Total score on (a) (b) and (c) is the sum of the partial scores on all 
items. Maximum score: 26. 


TABLE 33 

TABLE OP TIME CREDITS 
For &txd 


Time in Seconds Credits 

19 and below 4 

20-29 3 

30-44 2 

45-64 1 

65 and over 0 


9. Block Design 

Directions: This requires a box of 16 cubes and 9 designs, two of 
which are samples. Remove four cubes from the box and say: ^^You 
see these blocks are 'painted different colors on the different sides. One 
side is bluCy one red, one white, one yellow, one red and white, one blue and 
yellow. And all the blocks are exactly alike'\ The examiner next places 
Demonstration Card I in front of subject and says: ^^Now I want you to 
make me a design vnth the blocks so that it will look like the drawing on this 
cardJ^ ‘^Notice that the drawing is all red; so, to make one just like it with 
these 4 blocks I would have to arrange them this way.^* (Illustrating.) 
^^Do you understand?” (Pause.) ^Well, let's try another. This one is 
more difficult.” Examiner assembles blocks to make design shown on 

* The ear is made up of 2 pieces; if inverted, but otherwise correctly fitted into 
proper space credit 1 point. 
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Card II, pausing to make clear to subject bow two half designs can be 
put together to make one solid patch of color. When reasonably sure 
that subject understands, say — ‘ Wott? you take these 4 blocks and put them 
together so ihey^U make a design {picture) like the one you see on this card” 
^'Go ahead” When subject has finished with design, remark, ^'That's 
pretty good” Brush up the blocks, place second design before him 
saying, ^^Now make me one like this” Proceed with cards in order until 
design (5) is reached when examiner takes 5 more blocks out and says, 

TABLE 34 

Number of 

Card Blocke 

Demonstration T and II 4 

Designs 1 to 4 4 

Designs 5 and 6 9 

Design 7 16 

TABLE 35 

TIME SCORES FOR BLOCK DESIGN (WHEN CORRECT) 


Design 

Design 4 

Time 

Credits 

Time 

Credits 

1^-5" 

3 

I'-IO" 

3 


2 

ll'-lS' 

2 

11^-15'" 

1 

16''-25'' 

1 

Design 5 

Design 6 

Time 

Credits 

Time 

Credits 

l^->25^ 

3 

1 ^- 40 ^ 

3 

26^-35^ 

2 

41^-50'' 

2 

36^-50^ 

1 

51^-80^ 

1 


Design 7 




Time 

Credits 



I'-SO' 

3 



81'-100' 

2 



101''-140'' 

1 



*‘Now make me one like this, using 9 blocks” When design (7) is reached 
examiner takes out the remaining 7 blocks and says, ^^Now make me one 
like this iLsing 16 blocks” Continue presenting designs in numbered 
order until subject has failed three consecutive ones. Help or further 
demonstration on designs other than that given on Demonstration Cards 
(I & II) is not permissible. The maximum time allowed for completing 
the various designs is shown in table 34. 

It is not uncommon for some subjects, particularly children, when 
making the first design either to rotate the pattern or to reverse color 
and background. When this happens say “Is this right, does it look 


Time Limit 

75 seconds 
150 seconds 
195 seconds 
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just like the design?'’, and pause for a while. Usually the subject will 
make a spontaneous correction or will do it if the instructions are re- 
peated. Reversals and rotations appear to be associated with reading 
disability, but with many children they seem to be ''normal" phenomena. 

Scoring: The test is scored for both accuracy and time. Three (3) 
points are given for each design correctly reproduced within maximum 
time limits; and one (1), two (2), or three (3) additional points for suc- 
cess within certain times as indicated in table 35. To receive credit 
for tune, the design must be reproduced just as shown on the card. 
Blocks do not have to fit perfectly together, but the design must be 
absolutely correct. 

Maximum score: 42, 

10. Digit Symbol 

This test is taken from the Army Performance, but altered as to the 
directions and the time allowed. New time l i m it is 1| minutes. 

Directions: ‘Dook at these divided boxes or squares (pointing to key); 
notice that each has a number on the upper part and a little mark on the 
lower part. Also, that with every number there is a different mark. Now, 
look here (pointing to sample) where the boxes have only numbers, and the 
squares underneath are empty, I want you to put in each of these squares 
(pointing) the little mark that should go there, like thisJ"^ (The examiner 
points alternately to key and to the digit). ^‘Here is a 2, so you would 
put in this mark” (writing in symbol.) ‘‘Here is a 1, so you put in this 
mark. This is a 3, therefore, you would put in this mark,” etc. After 
doing of the samples, pause and ask; “Now what should I put here?” 
(indicating the next empty square.) If subject fails, examiner tells 
him and repeats question with next sample. After finishing the demon- 
stration, examiner says; “Now you begin here and fill as many squares 
as you can before I call time” / 

Scoring: Subject’s score is the total number of symbols correctly 
entered. Precision and neatness are disregarded, but recorded sym- 
bols must be identifiable. For the symbol H, subjects sometimes use 
the letter N, Count such reversals as half right. 

Maximum score: 67. 

VoCABULAEY TeST 

Directions: Examiner says; “I want to see about how many words you 
know. Listen carefully and tell me what these words mean, Apple — 
whot does apple mean? Donkey — what does donkey meant” Then 
proceed with words in listed order, repeating at each presentation the 
formula “What does meant” With more intelligent subjects, 
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the phrase ^^What does meanf” may be omitted after the third 

presentation. The following is the list and order^ of words to be used: 


VOGABXTLAKT 


1, Apple 

12. Cedar 

1^23. Vesper 

34. Hara-kiri 

2. Donkey 

13. Tint 

24. Belfry 

35. Chattel > 

3. Join 

a-^4. Armory 

25. Recede 

36. Dilatory 

4. Diamond 

15. Fable 

26. Affliction 

37. Amanuensis ^ 

5. Nuisance 

16. Brim 

27. Pewter 

38. Proselyte ‘ 

JB. Fur 

17. Guillotine 

28. Ballast 

39. Moiety ' "y 

40. Aseptic 

7. Cushion 

18. Plural 

29. Catacomb ^ 

8. Shilling 

19. Seclude 

30. Spangle 

41. Flout 

9. Gamble 

1/20. Nitroglycerine 

31. Espionage 

42. Traduce 

10. Bacon 

21. Stanza 

32. Imminent 


11. Nail 

22. Microscope 

33. Mantis 



Scoring: Scoring of definitions should be made with the aid of criteria 
given below. The general rule is that any recognized meaning of the 
word is acceptable, elegance of language and precision being disregarded. 
With this rule observed, most of the words on our list can be scored 
without great difficulty. Occasionally, however, one is, hard put to 
decide whether the subject does or does not know the meaning of a 
word. In such cases it is always permissible, and indeed imperative, 
to say: ^^Please explain a little moreJ^ While elegance of expression is 
disregarded, poverty of content is penalized to some extent. When a 
subject seems to know only vaguely what a word means his response is 
credited only as half right. Thus, f credit is given for such responses 
as “Apple^’ — “It grows on trees.’’ ^^Diamond” = “A diamond 
ring.” “Nuisance” = “A bad boy.” 

Except for very sujjerior subjects, it will seldom be necessary to go 
through the entire list. The general rule is to continue down the list 
until 5® successive words have received 0 credit and to count the remain- 
ing ones as failures. A subject’s final score is the sum of his partial 
(i) and full credits (+). 

It is generally desirable for the examiner to take down the subject’s 
responses verbatim. This not only permits rechecking the scoring, 
but, what is particularly important in mental cases, a qualitative 
evaluation of the subject’s thought processes. 

Maximum score: 4^, 

® The war and other factors have affected the difficulty of some of the words. 
Thus, 'nitroglycerine^, ‘espionage’ and ‘hara-kiri’ are much easier than originally 
estimated; on the other hand, ‘seclude’ seems to have been misplaced from the 
start and should be farther down the list. The original listing of the words, 
however, has not been altered, and the necessary changes in order of presentation 
are left to the examiner. 

® See footnote 5. 
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GENERAL REMAREB ON RECORDING AND SCORING OF 

TESTS 

The examiner will find the recording and scoring of tests greatly 
expedited if he uses the standard blank provided for this purpose. This 
blank consists of coded item outlines for the tests, with spaces at the 
right for responses and for time notations where the latter are required. 
Most of the subject’s responses or notations may be entered directly 
in the space next to each item, but in the case of the Similarities and 
the Comprehension Tests where verbatim recording is required, the sub- 
ject’s answers should be written out in full on the reverse side of the 
record blank. The advantage of this procedure is that it permits 
subsequent evaluation of responses and also the perusal of them by 
others who may want to go over them. 

The examiner should not try to do the arithmetical work incidental 
to recording, or try to consult the tables, etc., while examining the 
subject. 

Where a time record is required, the time counted is that which 
elapses from the moment the examiner has finished his instructions, to 
the moment that the subject indicates that he has completed a test. 

OBTAINING A SUBJECT’S WEIGHTED AND TOTAL SCORE 

To obtain a subject’s total weighted-score on any of the Bellevue 
Scales, proceed as follows; (1) Score the individual items of each test 
separately as indicated in the manual of directions. (2) Add the par- 
tial credits allowed for each item into a single score. These sums are 
the raw scores for each test. (3) Enter the raw scores attained by the 
subject on each test in the summary box of the record sheet in the 
column marked RS. (4) Look up the weighted scores for eaoh raw 
score in the table of equivalent weighted scores (table 36). This is 
done by first locating in the table, the column of the particular test that 
is being considered, finding the row in which that score occurs and 
running one’s finger to the extreme left or right of the column marked 
weighted score. The score thus found is entered in the col um n WT 
in the summary box on the line reserved for that test. It is these 
weighted scores which are used in obtaining M.A.’s, I.Q.’s and other 
ratings. (5) Tlie tests have been divided into two groups, one called 
Verbal and the other Performance, First add these separately to 
form the respective sub-totals for each group, then add the sub-totals 
together for a grand total, which is the score to be used for the subject’s 
ratings on the Full Scale, 
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The examiner should remember it is always the weighted or sigma 
scores that are used for the various ratings as given in our tables. The 
use of the I.Q. Tables is explained below. I.Q.’s may be obtained for 
Performance, Verbal or Full Scales, but ordinarily only the Full Scale 
I.Q. should be used in defining a subject’s general intelligence level. 

TABLE 36 


Table of weighted scores 


EQUIVALENT WEIGHTED 
SCOHS 

EAW SCORE 

EQUIVALENT WEIGHTED 
BCORB 

Information 

Comprehension 

Digit Span 

Arithmetic 

Similarities 

Vocabulary 

Picture Arrange- 
ment 

Picture Comple- 
tion 

Block Design 

Object Assembly 

Digit Symbol 

0 


0 

5 

1 


0 


2 


1 5-6 

0-3 

0 

1 

0 

1 


2 

0 

1-2 

I 0 

3 

0 

1 7 

4-7 

I 1 

2 

1 

2 

6 


1-2 

3-4 

1 1 

4 

1-2 

1 S 

8-11 

2 

3 

2-3 

3 

7 

3 

3 

5-6 

1 2-3 

5 ; 

3-5 

1 9 

12-15 

3 

4 

4-5 

4 

8 

4 

4 

7-9 

! 4 

6 ' 

6-7 

110-11 

16-19 

4 

5 

6 ' 

5-6 



5-6 

10-11 

5 

1 

8-10 

1 

20-23 

5 

6 

7-8 

7 

9 

5 

7 

12-14 

6 

7 * 

11-12 

13 

24-28 

6 

7 

9 

8 

10 

6 

8 

15-16 

7-8 

8 

13-15 

14-15 

29-32 

7 

8 

10-11 

9 



9-10 

17-19 

9 * 

9 

16-17 

16 

33-36 

8 

9 

12 

10 

11 

7 

11 

20-21 

10 

10. 

18-19 

17 

37-40 

9 

10 

13-14 

11 

12 

8 

12 

22-24 

11 

11 

20-22 

18 

41-44 

10 

11 

15-16 

12-13 

13 


13-14 

25-26 

12-13 


23-24 

19 

45-48 

11 

12 

17 

14 1 


9 

15 

27-28 

14 

12 1 

25-27 

20-21 

49-52 

12 

13 - 

is=a9 

15 

14 

10 : 

16 

29-31 

15 

13 

28-29 

22 

53-56 

13 

14 

20 

16 

15 

1 

17-181 

32-351 

16-17 

14 

30-32 

23 

57-61 

14 

15 

21-22 

17 


11 

19 

35-36 

18 

15 

33-34 

24 

62-65 

15 

16 

23 

18' 

16 

12 

20 

37-38 

19 


35-37 

25 

66-67i 

16 

17 

24 

19 

17 

13 

21-22 

39-40: 

20 and 


38 and 

26 


17 

18 



25 

20 


14 

23-24 

41-42 

over 


over 
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FINDING A SUBJECT’S I.Q. 

METHOD OF DBEIVATION *" 

The I.Q.’s in the appended tables have been calculated by dividing, 
at each age, the sigma score of the individual by the mean sigma score 
for his age group, with the zero point at different ages so adjusted as to 
mg.VR the I.Q.’s of all ages strictly comparable. 
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I.Q.’S FOR THE FULL SCALE-ADULTS, AGES 16 TO 60 

All I.Q/s are obtained from total scores. For the Full Scale this 
means the sum of 10 weighted test scores. If for some reason any 
test has been omitted, the incomplete score is brought up to a ten test 
equivalent by prorating. For example, if a subject has been given only 
9 tests, the total score on which is 72, the examiner prorates by taking 
10/9th of 72 and uses the resulting figure (80) as the subject's Full 
Scale score. Having obtained the subject’s total weighted score, the 
examiner turns to the table marked Full Scale. Here he looks for the 
score total in that part of the table in which the subject’s age group is 
listed. He does this by running his finger across from the column in 
which the weighted scores appear to the column listing the subject’s 
age. The number given in this column is the subject’s I.Q. Thus, 
if a subject aged 27, makes a weighted score of 75 on the Full Scale, 
one finds the number 75 in the column marked Weighted Score (page 
237), runs his finger across to the column marked Age 25 to 29 and notes 
the figure there given. In this case it happens to be 86, and this ac- 
cordingly is the subject’s I.Q. 

PERFORMANCE AND VERBAL I.Q.’S 

These I.Q.’s are obtained in the same way as the Full Scale I.Q.’s 
except that the total weighted score represents the sum of the 5 Per- 
formance or the 5 Verbal Tests respectively. The I.Q.’s for each scale 
are given separately. One must be careful when looking up any given 
score to look for it imder the proper table. 

I.Q.’s FOR CHILDREN, AGES 10 — 15 

While the BeUevue Examination is intended piimarily for the ex- 
amination of adults and adolescents, it was standardized from the age 
of 7 upwards. The I.Q. tables, however, have not been constructed 
for ages below 10, because it seemed unlikely that the tests would be 
used to any considerable extent below this age; at least, as organ- 
ized at pr^ent. Looking up I.Q.’s from ages 10 — 15 is done in the 
same ,way as in the case of adults, except that one must take into 
consideration the months as well as the years of the subject’s chrono- 
logical age. For the age period 10 years 0 months to 14 years 6 months, 
the I.Q.’s have been calculated at successive three month intervals; 
from age 14 years 6 months to 15 years 6 months, only at every 6 month 
interval. In obtaining a child’s I.Q, one looks for it imder the year — 
month interval most proximate to his actual chronological age. Thus, 



190 


MEASUREMENT OF ADULT INTELLIGENCE 


if a child is 10 years and 4 months, one should look for it under the 
interval 10 — ^3, if 13 years and 11 months, under the interval 14 — 0, etc. 

INTERPRETATION AND MEANING OF I.Q.'S 

The interpretation of an I.Q, or the mental classification assigned to 
it necessarily varies not only with the test used but also with the as- 
sumptions, statistical and otherwise, made by the classifier. If, how- 
ever, the per cent of population included in each category is definitely 


TABLE 37 

Intelligence classifications 


LQ. 

CLASaiFICATJON 

LIMITS IN TEEMS OF P.E. 

PER CENT 
INCLUDED 

65 and below 

Mental Defective 

—3 P.E.* and over j 

2.2* 

66-79 

Borderline 

-2 P.E. to -3 P.E. 

6.7 

80-90 

Dull Normal | 

-1 P.E. to -2 P.E. 

16.1 

91-110 

Average 

-1 P.E. to +1 P.E. 

50,0 

111-119 

Bright Normal 

+1 P.E. toH-2 P.E. 

16.1 

120-127 

Superior 

+2 P.E. to +3 P.E. 

6.7 

1^ and over 

Very Superior 

-}-3 P.E. and over 

2.2 


•^Setting — 3 P.E. as the upi)€r limit of the mentally defective group reduces 
the expected incidence of Mental Defectives in the total population to 2.2 per 
cent. We are not sure but that this figure is a little low, and that 3 i>er cent 
is a better estimate of the ' probable^' incidence for the country as a whole, 
and even for our own standardizing population. . If it should turn out to be 
that 3 per cent does more nearly represent the ^'trueP^ figure, then the P.E. 
demarcation of the upper limit of mental deficiency would have to be set 
at — 2.8 P.E. from the mean. In that case the I.Q. limits in our table would 
have to be altered ae follows: Menial Defective: I.Q. limits — 68 and below 

Borderline: I.Q. limits— 69-79- 

iudicated, it is possible to compare the relative standings of individuals 
whatever the basis used. The table of classifications that is offered 
here is based on the assumption that class limits are best defined sta- 
tistically. Thus, Mental Defectives are those who (with respect to 
intelligence) fall in the lowest 3 per cent of the population; Borderline, 
those who fall in the next 7 per cent; DuU-Normals, the 15 per cent 
following; Average, those included in the middle 50 per cent of the 
population, etc., etc. Th^e limits have been chosen for the reason 
explained in chapter 4. 

While I.Q limits given in the table will ordinarily determine the 
classification of the individual, there wiU arise from time to time cases 
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which need special consideration. At borderline points the examiner 
will necessarily have to use his own judgment. Thus, while an I.Q. 
of 65 is given on our table as the upper limit of mental deficiency, it is 
possible that there will be cases attaining I.Q.’s of 69 or 70 whom the 
examiner will more justifiably want to class as defective. So also, 
will there be individuals who attain I.Q.’s of 64 or 65 on the Bellevue 
who might be better placed in the Borderline group. But the examiner 
may not deviate from these limits to any considerable extent or use 
another scheme of interpreting these same I.Q.’s, for example, that 
of Terman or Kuhlmann. If the examiner believes for some reason 
that an I.Q. obtained on the Bellevue scale does not adequately repre- 
sent the intelligence level of the subject, as defined by the table, the 
only conclusion that he may legitimately draw is that in this case the 
tests are not suitable for measuring the subject’s intelligence. He may 
accordingly either reject the results, or use them only in a qualitative 
way, but having accepted them he is not privileged to disregard the 
I.Q. interpretation as given. 

CRITERIA FOR SCORING VARIOUS TESTS 

GENERAL COMPREHENSION TEST 

1. Envelope. General: Mail it. 

Samples of 
2 credits 

Put it in mail box 
Drop-in nearest box 

(In rural regions also credit.) Give to postman or bring to Post Office 
1 credit 

Run to post office and mail it 
Take it to the police or post office 
Give to the mail man (see above) 

If it is nearby I'd bring it to man to whom it is addressed, if far away, 
I'd put it in the mail box 
Take it to lost and found 
0 credit 

Take to owner 
Open it 

Take to person to whom it is addressed 
Send it back to whoever sent it 
Bring it back if you know the address 
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2. Theatre. General: Notify responsible person (prevent panic). 
Samples of 

.. ^ credits 

Keep quiet and notify the manager immediately 

Tell the manager about it 

CaU it to the attention of one of the ushers 

Tell orchestra to play — and notify manager 

Quietly call the usher and have h im give the general alarm 

Tell the man who shows you to your seat 

1 credit 

Give alarm quietly 

Ring the fire alarm 

CaU the fire engines, or firemen 

Investigate if fire is real and, if so, notify usher 

0 credit 

Warn the other i>eople 
Scream there is a fire 
Run out 

Walk to nearest exit 

TeU people there is a fixe and to use the nearest exit 
Go for water 

Try to get friends out as quickly as possible 

3. Bad Compa^tt. General: Influence. 

Samples of 

2 credits 

They teach you to be bad 

So that you wouldn^t be bad yourself 

Influence our behavior — our thoughts 

We degenerate ourselves 

Keep from being spoiled by them 

Usually foUow in their footsteps 

Degrade us 

Aflect our morals 

Become like them 

1 credit 

To keep your reputation 
WiU get you into trouble ^ 

Keep ourselves out of trouble 
FinaUy become criminal yourself 
Teaches you to be a crook 

Because if you hang around you have to be like them 
Because I don^t want to be bad 
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0 credit 

'Cause they don't talk what good children should hear 

'Cause don't learn anything good 

They give trouble 

They are not good for anybody 

Not beneficial 

To live honest 

So we wouldn't be picked up 


4. Taxes. General: To support the government. 

Samples of 
2 credits 

To support government 

Upkeep of the community in which they live 

So government should keep going 

To pay government bills 

To run country (explained) 

1 credit 

Eor upkeep of institutions of all kinds 

To help with the war effort 

Pay police, firemen, public officials 

Pay for the army and navy and the things they need. 

To support schools and parks 

Support for public works 

So city can get things done 

Pay for public organizations of government 

For Home Relief and hospitals^ 

0 credit 

For unemployed (poor people)^ 

Because the President has to pay off people 
So that other people could make more money 
Have government do things that are expyensive 
They have property protected; must pay for that 
Give relief (alone)^ 

Higher taxes, better service 
Help city and city help you 
For the city 

N.B, : The answer, 'Tor Home RelieP^, etc., is given no credit if stated as sole 
answer, but credited 1 if accompanied by mention of another governmental func- 
tion, as "for Home Relief and hospitals.^^ If subject mentions "Home Relief' 
alone, however, the Examiner may not say "and what else?" Similarly, 0 score 
is given for any other singly mentioned governmental function, e.g. "For 
police," or “For schools." 
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5. Leather (Shoes). General: Durable, flexible, comfortable, economical. 

2 credits if subject mentions at least 3 different qucdUies 

(1) DwroMlity: Take a lot of wear, strong, wears well, lasts, durable, 

stands elements 

(2) Flexibility: Pliable, soft, molds well, shapes well, easy to wear, 

because of softness 

(3) Comfort: Not hard on feet, easy to wear, light, comfortable, 

porous 

(4) Economic factor: Good supply, have a lot of, easy to get, cheap, 

not too expensive 

1 credit if subject mentions 1 or 3 different qualities 

0 credit if subject mentions no essential quality, thus; 

Most substantial product it can be made of 
Tradition 

To look dressed up 

Shock absorbers 

Most convenient 

Protect feet 

Can be fixed 

Leather does not bum 

Best wearing material (unexplained) 

6. City Land, General: Greater demand. 

Samples of 

B credits 

More demand for land in city 

Being used more, more people want the location 

More people want to buy it 

Supply and demand — more demand in crowded city 

Harder to get in the city (explained as demand) 

Many people after it (if explained satisfactorily) 

1 credit 

City has more people 
Limited amount of land 
Scarce in city 

Can profit more from land in city 

More conveniences, more stores and good transportation® 

Business possibilities 

Commercial reasons — one can erect buildings which would bring in 
more than in country 


« A credit of 1 is given if 2 or more ''conveniences'* are mentioned. When only 
one is mentioned, no credit is allowed. 
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0 credit 

Buildings axe larger in city, they cost more 
There are houses on land in city, house worth something 
Because hardly anyone wants to live in the country 
In city there are more things — ^taxes, congestion 
Because people have more chance to get work 

7. Fokest. General: Use of natural fact as means of finding direction. 
Samples of 

2 credits 

By the sun (explained)® 

By noting what side moss grows on trees and getting my direction 
accordingly 

1 credit 

By the sun— (unexplained) Moss — (unexplained) 

Climb to top of tallest tree and try to locate landmark 
Follow a path 
Follow a stream 

Go in one direction — ^mark trees as I went 
Try to set a fixe to cause a lot of smoke 
Look for landmarks for bearings 
0 credit 

Go back the way I came 

Ask somebody — or a pohceman 

Walk until I get somewheres 

I would shout 

Look for my foot tracks 

Look at the sky and foUow the open spaces 

Keep walking imtil I meet someone 

Wait for night to come and direct my^lf by the North Star 

8. Laws. General: Social regulation. 

2 credits 

In order to regulate society 

To preserve life, liberty and keep people living together harmoniously 

For safeguarding lives and property of individuals 

To make human associations possible without friction 

life so complex that laws are necessary to live amicably 

Where there is any group, there are rules necessary for the good of all 

»J[t is important to make sure that the subject actually knows how the sim 
would help in obtaining direction. If there is any doubt, ask how wotdd it 

help youP^ 




196 


MEASUREMENT OF ADULT INTELLIGENCE 


1 credit 

PubKc good, public welfare 

Regulations to keep people in check, to make them respect other 
people^s rights 

To govern people — ^to keep them in control 

Keep people from committing crimes 

Keep up the standards of civilization 

Otherwise people would act in confusion 

For an organized community, otherwise it would be chaos 

Insures fair play for aU 

Protects the good people from the bad people 

To keep peace and order 

To protect the people 

To make people realize what is right and wrong 
Puts people in their place 

If there were no laws, people would do what they like 

0 credit 

Keep people from getting killed 

If it were not for the laws everyone would kill himself 

To make people obey — ^they make me obey 

To prevent accidents 

So that no one would get in trouble 

To keep up decency 

To keep us from making mistakes 

Otherwise would do wrong things 

So that people will do what is right 

Curb thieves 

9. Maeriage. General: As useful public record. 

Samples of 

2 credits 
Records (For) 

For traceable record 

As record to show they are legally married 
So that they could keep track of marriages 

1 credit 

Prevent bigamy (explained) 

For census purposes 

Proof that you Ye married in case of l^al matters 
0 credit 
A formality 

To protect morals of community; to protect women and children 
So people will know they are married 
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Makes it legal 

Gives offspring a name 

Law demands it 

Protect honor of womanhood 

To keep self respect and not have a common law noarriage 
Otherwise people would be living like animals 
If wife is in trouble she can go to court 

10. Deaf — Talk. General: One must first hear spoken word before being 
able to repeat it. 

Samples of 
2 credits 

Because you learn to talk by hearing 

If you can^t hear language you can^t learn to articulate sounds 
If you don’t hear soimds they make, you don’t know how to make the 
sound afterwards 

1 credit 

Can’t imagine what words are like 
Because he can’t hear what is said 
Do not know how other people talk 
Can’t talk because he can’t hear^° 

0 credit 

Can’t hear themselves talk 
Ear and throat connected 
Along with deafness comes dumbness 
Something is wrong with their nerves 
Because they were bom that way 

Criteria for alternate questions 

1. Save Monet. General: To have something for “a rainy day”, or for 
emergency or future use. 

2 credits 

For a rainy day 

So they’ll have it when they’ll need it 
In case of emergency 
In case of need in bad times 
Something to depend on — ^for security 


This answer cannot be taken at face value. Many subjects who give it 
haven’t the slightest idea of the connection. Examiner should then ask “Hoir 
do you meanV’ If no satisfactory explanation is received, give zero credit. On 
the other hand, if the subject improves his response by saying, '^because you 
can’t learn,” he is given full credit. 
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1 credit 

In case of accidents 

To have money to support father or mother, or for a funeral 

To have something when you get too old to work 

In case of sickness 

You can never tell what happens 

In case you lose your job 

For college education 

If they want to buy a house or a car 

So they can have luxury (if explained) 

0 credit 

For their own interests 
Save money for yourself 

If they^ll lose money they^U turn out to be beggars 

If they spend it aU they won’t have money to buy food with 

Use it for food to live on 

2. Feiend. General: Try to procure it. 

^ credits 

Do my best to get it 

I’d try to get it if it is possible 

Borrow it 

TeU him I don’t have it and try to find someone who does 

1 credit 

Refer him to person who has it 

Ten him I haven’t it, suggest where he can get it 

^'Sorry haven’t it.” Refer him to where he can get it 

0 credit 

Say “Sorry I haven’t got it.” 

Explain I’d like to give it but haven’t got it 
Refuse him 

If I didn’t have it couldn’t give it to him 

CRITERIA FOR SIMILAiUTIES 

General scoring: 

2 credits for the most fundamental likeness 

1 credit for any essential likeness 

0 for failure to respond, giving a difference, or mentioning some in- 
consequential similarity 
1. Orakoe — ^Banana 

Score 2 (Both) fruits 
(Both) foods 

Score 1 (Both) have peels 
You eat both 
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They both grow 
Both contain vitamines 
Score 0 Both contain the same calories 
They are both yellow 
Both same color 
Both same shape 

2. Coat — Dress 

Score 2 (Both) garments 

(Both) wearing apparel 
(Both) articles of clothing 
Score 1 You wear both 
Cover the body 
(Both) made from cloth 
Keep you warm 
Things to put on 

Score 0 Both made of the same material 

You put the coat on top of the dress 
Both have buttons 


3. Dog — Lion 

Score 2 (Both) animals 
(Both) mammals 

Score 1 (Both) have legs (or furs; eyes; tails, etc.) 

(Both) belong to the same species 
Score 0 Lion growls and dog barks 

4. Wagon — ^Bicycle 

Score 2 (Both) vehicles 

(Both) conveyances 
(Both) means of transportation 
Score 1 (Both) movable toys . 

You ride on them 
They both have wheels 
Can carry things around 
Children play with them 
They both roll — or both ride 
Score 0 Can push both 

5* Daily Paper — ^Radio 

Score 2 (Both) sources of information 
Modes of communication 
Can convey thoughts through them 
Score 1 They give you the news 



200 


MEASUREMENT OF ADULT INTELLIGENCE 


(Both) tell stories 
(Both) means of amusement 
(Both) teach you something 
(Both) used for advertising 

Score 0 You get music from the radio but not from the newspaper 
6. Aie — ^Water 

Score 2 (Both) necessary for life 

Have to have air to live and must have water to live 
(Both) necessary for functioning of human body 
CanT live without them 
Score 1 (Both) elements 

(Both) have oxygen 
(Both) means of transportation 
(Both) help us in living 
You need both (explained) 

Score 0 Nature^s gifts 

Oxygen is in the air and hydrogen is in the air and water is HgO 

For breathing, for fish, and human beings 

Breathe the air and drink water 

Water is a substance and air is a substance 

Keeps on going; so does water keep on going 

Both in clouds 

Need both (unexplained) 


7. Wood — Alcohol 

Score 2 (Both) organic substances 

(Both) are hydro-carbons (or have carbon in them) 

Score 1 (Both) bum 

(Both) used as fuel 

(Both) used in the manufacturing of important materials 
Score 0 Both useful 

Alcohol comes from wood 
Both knock you out 


8. Ete— Ear 

Score 2 (Both) sense organs (or senses) 

(Both) receptors of stimuli 
(Both) organs of perception 
Score 1 Parts of the body 
Parts of the head 
You know from them 
Get knowledge through them 
Score 0 Hear with the ear and see with the eye 
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Both have an opening 

Both can tell what you are saying — you read lips and 
hears what you say 
You need both 


9. Egg — Seed 

Score 2 (Both) embryonic substances 
(Both) beginnings of life 
(Both) reproduce what they come from 
Yoimg come from both 

Score 1 (Both) give life (incompletely explained) 

Means of reproduction (if not further explained) 

(Both) can create 
(Both) have shells 
Something grows from both 
Score 0 You can eat both of them 
Both have a yolk 
Both are round 

10. Poem — Stattje 

Score 2 (Both) works of art 

(Both) expression of artistic feeling 
(Both) are artistic creations 
Score 1 (Both) are beautiful things 
(Both) stir feelings 
(Both) symbolic 

(Both) are memorial, make you remember 
(Both) tell stories 
(Both) made by man 
Express an emotion or idea 
Score 0 Both lifeless 

A poem may describe a statue 
You read a poem and look at a statue 
People like them 
Lost a long time, if good 

11. Pbaise — ^Ptjnishment 

Score 2 (Both) means of influencing the behavior of others 
(Both) methods of discipline 
(Both) methods of motivation 
Score 1 (Both) are forms of criticism 

Used on people to put them in a certain emotional state 
Means of getting things done 
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(Both) affect a person (if explained) 
(Both) methods of training a child 


12. Fly— Tree 

Score 2 (Both) have life 

(Both) living things 
Score 1 (Both) breathe 

(Both) created by nature 
(Both) grow 

(Both) need sunlight to live 
(Both) need food 

Score 0 Fly is small, tree is large 
Both useful to man 

Fly has wings and tree has leaves and branches 
Both outdoors aU the time 
Both carry germs 

CRITERIA FOR VOCABULARY TEST 

1, Apple. Dictionary definition: Edible fruit. 

Samples of 

1 credit 
Fruit 

Something to eat, like a peach or a pear 
i Credit 
Red and sweet 
It grows on trees 

2, Donkey. Dictionary definition: An ass, a long eared equine quadruped 

smaller than a horse, 

1 Credit 
Animal 
Jackass 

Beast of burden 
Like a horse, only smaller 
Mule 
§ CredU 
Has long ears 
Ride on him 

3, Join. Dictionary definition: Set or bring together ^ to connect or to combine. 

To lie in contactj to become associated. To engage in together, 

1 CredU 

To bring togetho* 

Join a church or club 
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To register 
To connect 
To combine 
To unite 
To sign up 
i Credit 
To contact 

4. Diamond. Dictionary definition: Gem of great brilliancy. Figure hounded 

by four equal straight sides and having two angles acute and two obtiise. 
Diamond-shaped spot on a playing card, {Baseball) Square space 
enclosed by the lines connecting bases, 

1 Credit 
Gem 

Precious stone 
Diamond spot on card 
Jewel 

Something you get imderground for a ring 
Substance made of carbon 
A baseball diamond 
J Credil 

A (diamond) ring 
It’s valuable 

5. Nuisance. Dictionary definition: That which annoy Sj vexes, or harms, 

1 Credit 
A pest 

Someone who bothers you 
Pain in the neck 
Person who gets in the way 
} Credit 
A bad boy 
Trouble 

6. Fue. Dictionary defirdtion: Soft fine coat covering the skin of many mam- 

mals, Skins of fur-bearing animals, also apparel made from them. 
Any fuzzy covering. 

1 Credit 
A skin 

A coat of animal 

What some animals have on top 

Get from off an animal 

Hair of animals 

They make coats from them 
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i Credit 
To keep wann 
People wear it 
Hairy stuff 
Clothes 

7. Cushion. Dictionary definition: Plau on a cushion. Flexible covering 

filled loith elastic material or air. Any device to deaden jar, 

1 Credit 
A soft seat 
A pillow 

A pad for a seat or sofa 
A shock absorber 
You sit on it 
You lie on it 
i Credit 

To make you comfortable 

8. Shilling. Dictionary definition: A coin of Great Britain, 24^. 

1 Credit 

A foreign coin, (English) money 
It’s worth 25 cents 
i Credit 
A penny 

9. Gamble. Dictionary definition: Squander by gaming; to game; to risk or 

wager something of value upon a chance, 

1 Credit 
Win or lose 
Take a chance 
To bet 
To sj)eculate 
To gamble money 
To play cards or dice 

10. Bacon. Dictionary ddfinition: Scdted, dried or smoked flesh of the hog, 

1 Credit 

Meat from a pig 
(T 3 rpe o^ pork 
} Credit 
Meat 

Something you eat 
Food 
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0 Credit 

Ribs from an animal 

11. Nail. Dictionary definition: Thin horny plaie on end of finger or toe. 

Slender piece of metal need for driving into wood. 

1 Credit 
Spike 
Brad 

To fix wood together 
A piece of metal to hold things together 
Hard skin at the end of the fingers or toes 
Fingernail 
J Credit 

Something to hammer 
Metal 

12. Cedak. Dictionary definition: Tree of the pine family halving fragrant 

wood. 

1 Credit 
An evergreen 
A kind of wood 
A tree 
i Credit 
A cedar chest 
A kind of box 

13. Tint. Dictionary definition: Tinge, hue, a slight color. 

1 Credit 
A dye 

A (blended) color 
To paint 

To brighten a faded fabric 
Add a bit of coloring 

0 Credit 

A texture 

14. Abmobt. Dictionary definition: Depository for arms; place for assembling 

of troops. 

1 Credit 

Where they train to be soldiers 
. Where soldiers and arms are kept 
Military building (explained) 

J Credit 

A soldier's club 
N. G. Headquarters 
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0 Credit 

A training camp 
People stay there 
Fort 

15. Fable. Dictionary definition: Brief to2e with moral, with persons, animals 

or inanimale things as speaker and actor. Legend or myth; an invention, 
fabrication, 

1 Credit 

A fairy tale 

A short story with a moral 
A myth 
Fiction (story) 
i Credit 
To teU a lie 

16. Bbim. Dictionary definition: Margin, edge, border. To fiB or he fiUed 

to the brim, 

1 CredU 

The edge of something (like a hat) 

The rim 
1 Credit 
The end 

0 Credit 

Something wide 
Surface 

17. Guillotinb. Dictionary definition: To behead. Instrument for beheading 

criminals in which a weighted knife is used, 

1 Credit 

Form of execution (beheading) in France 
Used to chop heads off 
I Credit 
A knife 

18. Plural. Dictionary definition: Containing more than one, 

1 CredU 
More than one 
Several 
Not singular 
i CredU 

Change “y^^ to and add ^‘es” 

In grammar, number 
Double of anything 
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0 Credit 

Part of sentence 
Part of a verb 
Comparison 

19. Seclude. Dictionary definition: Remove and keep apart, as from society. 

1 Credit 
Keep apart 
To hide 

To be alone, by yourself 
To isolate 
§ Credil 
To separate 
Out of the way 
A quiet spot 
Lonely 

0 Credit 

In a snug place 

20. Nitroglycerine. Dictionary definition: Light yellow oily explosive liquid 

made by treating glycerine with nitric acid. 

1 Credit 

An explosive 
A chemical 

(a drug) used as heart stimulant 
i Credit 
A medicine 
Used in ammunition 

0 Credit 

A cleaning substance 
A food 
An acid 

21. Stanza. Dictionary definition: Group of rimed lines, forming one of series 

of divisions of a poem. 

1 Credit 

Group of rimed lines 
Part of a poem 
Verse in a poem 
Part of a soAg 

There are so many stanzas in a poem 
J Credit 
A line 
A phrase 
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A few lines to make one thought 
A paragraph 
A poem 

22. Microscope. Dictionary definition: Instrument for assisting the eye to see 

minute objects. 

1 Credit 

Used to magnify small objects 
Instrument that makes small things look large 
i Credit 

An instrument you see germs with 
Something that enlarges 

0 Credit 

Something that brings things closer when you look through them 
Something to look through 

23. Vesper. Dictionary definition: Pertaining to evening or to the service of 

Vesper prayers, or songs; hell that calls to Vespers. 

1 Credit 
Church service 

H 3 rnm or evening prayer 
} Credit 
Twilight 

24. Belfry. Dictionary definition: Tower in which a hell is hung, or the part 

containing the bell. 

1 Credit 

Bell tower in church 
(Steeple) where bell is kept 
J Credit 
Tower 

0 Credit 
Abell 

The ringing of a bell 

25. Recede. Dictionary definition: To move, to tend or incline backward; 

to withdraw; to cede back. 

1 Credit 

To go back 
To withdraw 
i Credit 
To quit 

26. AiTLicnoN. Dictionai^ definition: Distress of body or mind or that which 

causes it; gri^; calamity. 
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1 Ct^dU 

Grief 

Misfortune 
Malady 
i Credit 
A disease 
A deformation 

Something which inconveniences you 
A hurt 

0 Credit 
To harm 

27. Pewter. Dictionary definition: An alloy of tin and lead often used for 

tableware; vessels collectively of pewter, 

1 Credit 
Alloy 

Metal from which dishes are made 
Tableware 
1 Credit 
A metal 

Pottery made of metal 
Lead and something 

0 Credit 
Chinese vase 

A certain type of silver 
Pew in a church 

28. Ballast. Dictionary definition: To provide or fill with ballast; any heavy 

substance as sand, etc., laid in the hold of a vessel to steady it, 

1 Credit 

Weight used on shijjs to steady them 
Added weight 

Gravel or broken stone (for a railroad bed) 
i Credit 
A weight 

To even things off with weight 

29. Catacomb. Dictionary definition: Underground gallery used for burial 

place, 

1 Credit 

Underground cell used in ancient times by Christians 
Underground tunnel with skeleton r emain s 
J Credit 
Cave 
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0 Credit 

The Egyptians used it 

30. Spangle. Dictionary definition: SiYiaH sparkling object; small bit of metal 

foil or brilliant tin for decoration in dress; to adorn with spangles, 

1 Credit 

Glittering substance used for decoration 
Shiny thing on a dress 
J Credit 

A gleaming object 
A piece of jewelry on dress 
An ornament 

0 Credit 

Something dotted 

Something about the star-spangled banner 

31. Espionage. Dictionary definition: Practice of spying; excessive or offensive 

surveillance, 

1 Credit 

Act of spying 
Spy system 
Spy work 
1 Credit 
like a spy 

0 Credit 

like sabotage 

32. Imminent. 'Diciionsxy de&milon: Dangerous and close at hand; impending, 

1 Credit 

Near or going to happen 
Immediate 
Hovering 
i Credit 
Likely 
Probable 
Sure to come 

0 Credit 

Something occurring at the present time 

33. Mantis. Dictionary definition: Insect which assumes a position as of 

prayer when waiting for prey, 

1 Credit 
Insect 

Bug like a g^rasshopper 
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i Credit 

Bug (unexplained) 
0 Credit 
A butterfly 


34. Haraexri. Dictionary definition: A Japanese method of committing 

suicide by ripping open the bowels, 

1 Credit 

Japanese method of committing suicide 
Disemboweling 
i Credit 

Self-strangulation used by Japanese 
Falling on a sword 

0 Credit 

Suicide in China 

35. Chattel. Dictionary defimtion: Article of pcrsoncd property; a movable, 

1 Credit 

Legal term for personal property 
A possession 
J Credit 
A horse 
A slave 

0 Credit 

A (first) mortgage 
Heal property 

36. Dilatort. Dictionary definition: Tending to cause tardiness^ sloumess and 

delay, 

1 Credit 
Postponing 
Making for delay 

Waits a long time before doing something 
} Credit 
Slowness 

Tardiness « 

Holding up 
0 Credit 
Dilate 
Lazy 

37. Amanuensis. Dictionary definition: One who copies manuscript or takes 

dictation. 
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1 Credit 
A scribe 
A secretary 

0 Credit 

A servant 

38. Proselyte. Dictionary definition: To win over to different religion or 

'party; one thus won over, 

1 Credit 
Convert 

i Credit 

A man who changes his religion 
It^s a follower 

0 Credit 

A person connected with religion 
A disciple 

39. Moiety. Dictionary definition: Half; a small part, 

1 Credit 
A half 

A small portion 
A lesser portion 
i Credit 
A quantity 

40. Aseptic. Dictionary definition: Free from disease germs or tendency to 

putrefaction. 

1 Credit 
Without germs 
Sterile 
i Credit 
Clean 

0 Credit 

Something against bacteria 
Non-poisonous 

. * 

41. Flout. Dictionary definition: To scoff at; jeer. A gibe; scoff. 

1 Credit 
Sneer at , 

Ignore authority 

J Credit 
Insult 
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0 Credit 
To flaunt 

42. Teaducb. Dictionary definition: To misrepresent xcilfuUy; defame; slander. 

1 Credit 

Harm person’s character 
Vilify 
i Credit 

To lower (in esteem) 



BIBLIOGRAPHY 

Studies with the Wechsler-Bellevue Scale 

1. Andebson, E. E. et al.: Wilson College studies in psychology. I. A com- 

parison of Wechsler-Bellevue, Binet and ACE at college level. J. Psychol, y 
1942, 14: 317-000. 

2. Bajlinskt, B.: An analysis of the mental factors of various age groups from 

9-60. Genet, Psychol. Monog.y 1941, 23: 191-234. 

3. Balinsky, B., Iskabl, H., and Wechsler, D.: Relative effectiveness of 

the Stanford-Binet and Bellevue Scale, etc. Amer. J. Orthopsychiat., 
1939, 9: 798-801. 

4. Benton, A. L., Welder, A., and Blaijvelt, J.: Performance of adult 

patients on Bellevue and Revised Binet. Psychiat. QtmrLj 1941, 16: 
802-806. 

5. Brody, M. B.: A survey of the results of intelligence tests in psychosis. 

Brit. J. Med. Psych., 1942, 19: 215-257. 

6. Brown, R, R., and Partington, J. E.: The intelligence of the narcotic 

drug addict. J. Genet. Psychol., 1942, 26: 175. 

7. Brown, R. R., and Partington, J. E.: A psychometric comparison of 

narcotic addicts with hospital attendants. J. Genet. Psychol., 1942, 27: 
71. 

8. Fishbein, S. : An evaluation of the Wechsler-Bellevue Intelligence tests . . . 

on the College Level, 1941. Master’s Thesis, Temple University. 

9. Goidparb, W,: An investigation of reaction time in older adults, T. C. 

Contributions to Education, No. 831. New York, 1941. 

10. Halpern, F-: Comparison of Revised Stanford with Bellevue Adult, etc. 

Psychiat. Quart. SuppL, 1942, 16: 206-211. 

11. BIayes, S. P, : Alternative scales for the mental measurement of the visually 

handicapped. Outlook for the Blind, 1942, 36: 225-000. 

12. Knight, R. P., Gill, M., Lozoff, M., and Rapaport, L).: Comparisons 

of clinical findings and psychological tests in three cases bearing on mili- 
tary selection of personnel. Bull. Menninger Clinic, 1943, 7: 114-128. 

13. Lewinski, R. j.: Performance of Naval Recruits on the Kent Oral Emer- 

gency Test and the verbal battery of the Wechsler-Bellevue Adult Intel- 
ligence Scale. Amer. J. Orthopsychiat., 1943, 13: 138-140. 

14. Lewinski, R. J.: Intertest variability of subnormal Naval Recruits on the 

Bellevue Verbal Scale. J. Abnorm. and Soc. Psychol., 1943, 38: 540-44. 

15. Levi, J.: A psychometric pattern of the adolescent psychopathic person- 

ality. Doctoral Dissertation, New York University, N. Y., 1943. 

16. Machover, S. : Cultural and racial variations in patterns of intellect. T. 

C. Coniributiom to Education, No. 875. New York, 1943. 

214 



BIBLIOGRAPHY 


215 


17. Margaret, A.: Parallels m the behavior of schizophrenics, paretics and 

presenile non-psychotic patients. J. Abnorm. and Soc, Psychol.^ 1942, 
37: 511-528. 

18. Maqaret, a., and Wright, C.: Limitations in the use of test perfor- 

mances to detect mental disturbance. J. Applied Psychol.^ 1943, 27: 
387-398. 

19. Maizlish, I. L.: a comparison of Stanford-Binet & Bellevue with adult 

offenders. Psych, Bull., 1942, 39: 472. 

20. Mitchell, M. B. : Performance of mental hospital patients on the Wechsler- 

Bellevue and the Revised Stanford-Binet, Form L. J. Educ. Psychol, 
1942, 53&-544. 

21. Rabin, A. I.: Test score patterns in schizophrenic and nonpsychotic states, 

J. Psychol, 1941, 12: 91-100. 

22. Rabin, A. I.: Differentiating psychometric patterns in schizophrenia and 

manic depressive psychosis. J, Abn. and Soc, Psychol, 1942, 37; 270- 
272. 

23. Rabin, A. I.: Fluctuations in mental levM of schizophrenic patients. Psy- 

chiat. Quart., 1944, 18: 78-92. 

24. Rabin, A. L: A short form of Ihe Wechsler-Bellevue test. J. Applied 

Psychol, 1943, 27: 320-324. 

25. Reichard, S., and Schafer, R: The clinical significance of scatter on the 

Bellevue Scale. Bidl. Menninger Clinic, 1943, 9: 93-99. 

20. Trist, E. L.: Short tests of low grade intelligence. Ill, Occup. Psych., 
London, 1941, 15: 120. 

27. Wechsler, D.: Effects of alcohol on mental activity. Quart. J. Stud. 

Alcohol, 1942, 2: 479. 

28. Wechler, D., Israel, H., aih) Balinskt, B.: A study of the subtests of 

the Bellevue Intelligence Scale in borderline and mental defective cases. 
Amer. J. Ment Def., 1941, 45: 555-558, 

29. Wechsler, D., Halpern, F., and Jaros, E.: Psychometric study of insulin- 

treated schizophrenics. Psychiat. Quart., 1940, 14: 466. 

30. Weider, a.: Effects of age on the Bellevue Scale in schizophrenic patients. 

Psychicd. Quart., 1943, 17: 337-346. 

31. Weider, A., Levi J., Aim Risen, F.: Performance of problem children on 

the Wechsler-Bellevue Intelligence Scale and the Revised Stanford- 
Binet. Psychiat. Quart., 1943, 17; 695-701. 




APPENDICES 




Appendix 1 

SPECIAL STATISTICAL METHODS 

A. METHOD USED TO CONVERT RAW SCORES INTO WEIGHTED 
(STANDARD) SCORES 

1. Mean and S.D. of raw samplings obtained for each test. 

2. New mean of 10 and S.D. of 3 assigned to each test. 

3. Every raw score for eacb test equated against new weighted (standard) 

score, point by point, by following Hull^s method: 

X, = Ms + ^ (Xi - Ml) 

<ri 

Where, Mj ** assigned mean (10) 

<r 2 ** assigned sigma (3) 

Xj = new score to be fotmd (weighted or standard score) 

Ml *= calculated mean 
<ri *= calculated sigma 
Xi =* raw score 


B. METHOD USED TO CALCULATE INTELLIGENCE QUOTIENTS 

1. Mean and S.D. of weighted score distributions obtained for each age level. 

2. All weighted scores for each age level converted into scores by usual 

formula (z « where M’s and «r’s are the constants of the different age 

level distributions. Charts drawn up for each age level giving the weighted 
scores and their corresponding “z” scores. 

3. ‘‘z” scores for the different age distributions equated against P.E. limits 
such that the score .6745<r (1 P.E.) below the mean would in all age levels give 
an I.Q. of 90. This was carried through as follows: 

(a) For each age level locate the weighted score at .6745<r below the mean 

of the group. 

(b) Calculate zero point for each age level by the formula 


Y + z^ 

y 


LQ. 


(where Y ** zero point, z' =* —.6745 and I.Q.' ** .90). 

Obviously when I.Q.' is set at 90 the zero point then will be equal to 
6.745. 

(c) With the zero point at 6.745, get I.Q. for every weighted score in 
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each age level by (1) looking up charts containing scores and 
their corresponding weighted scores and then (2) solving the formula: 


LQ.. 


6.745 -1- 
6.745 


where I.Q; *=* I.Q. for particular weighted score at any age and 
z; = “z” score for particular weighted score at any age. 


C. EQUATIONS FOR PREDICTING MEAN BELLEVUE TEST SCORES 
FOR ANY AGE— 25 AND OVER 

1. From equation obtained by method of least squares. 

y = ,746z + S1.63 

Where y mean (most representative) score at any age and x =* difference 
between 47.5 and the chronological age (A) at which score is desired. 
Illustration: Required mean score for age 57.5. 

y - .746 (47.5 - 57.5) + 81.63 - 74.2 


2. From regression equation of the two variables, age on test score, 
y -= .735® + 85.77 

where y *» mean (most representative) score at any age, and x =■ difference 
between 40 and the chronological age (A) at which score is desired. 
Illustration: Required mean score at age 70. 

y = .735 (40-70) + 85.77 - 63.7 

N.B.: Equation (1) seems to give better values, but the discrepancies in 
score obtained by the two methods are very slight (seldom more than 1 or 2 
points). 


Conversely, one may use the above equations to calculate the age for which 
any given score is the mean. The case which is of particular interest is the 
age beyond which no further decline is possible. For our adult scale this age 
turns out to be approximately 150 years. This value may be obtained from 
either equation. Thus from equation (1) : 

4 .746 (47.5 - A) -f 81.63 

Setting y =* 4 (lowest possible score). 

Whence, A = 151.5. 

^ Appendix 2 

EFFICIENCY QUOTIENTS 

An individual's Efficiency Quotient is his mental ability score (on the Full 
Bellevue Adult Scale) when compared with the score of the average individual 
20 to 24 years of age. We have chosen the age 20 to 24 as a standard of com- 
parison because it may be r^arded as our maximal age group. It is the age 
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at which education for most adults is pretty much complete, and after which^ 
again for most individuals, intelligence test scores begin to fail off. To get an 
E.Q. one obtains a subject ^s test score in the usual way, but instead of looking 

TABLE 38 


Efficiency Quotients (E.Q.) 
Full Scale 


wsiasTso 

6COHB 

E.Q. 

WSIGBTZD 

SCOBS 

E.Q. 

wsiGsrss 

SCOBS 

E.Q. 

VSIGHTSD 

SCOBS 

E.Q. 

5 

34 

35 

56 

65 

77 

95 

98 

6 

35 

36 

56 

66 

77 

96 

99 

7 

35 

37 

57 

67 

78 

97 

99 

8 

36 

38 

57 

68 

79 

98 

100 

9 

37 

39 

58 

69 

79 

99 

101 

10 

37 

40 

' 59 

70 

80 

100 

101 

11 

38 

41 

59 

71 

81 

101 

102 

12 

39 

42 

60 

72 

81 

102 

103 

13 

40 

43 

61 

73 

82 

103 

104 

14 

40 

44 

62 

74 

83 

104 

104 

15 

41 

45 

62 

75 

84 

105 

105 

16 

42 

46 

63 

76 

84 

106 

106 

17 

42 

47 

64 

77 

85 

107 

106 

18 

43 

48 

64 

78 

86 

108 

107 

19 

44 

49 

65 

79 


109 

108 

20 

44 

50 

66 

80 

87 

110 

109 

21 

45 

51 

67 

81 

88 

111 

109 

22 

46 

52 

67 

82 

89 

112 

110 

23 

47 

53 

68. 

83 

89 

113 

111 

24 

47 

54 

69 

84 

90 

114 

111 

25 

48 

55 

69 

85 

91 

115 

112 

26 

49 

56 

70 

86 

92 

120 

116 

27 

49 

57 

71 

87 

92 

125 

119 

28 

50 

58 

72 


93 

130 

123 

29 

51 

59 

72 

89 

94 

135 

126 

30 

52 

60 

73 

90 

94 

140 

130 

31 


61 

74 

91 

95 

145 

133 

32 

53 

62 

74 

92 

96 

150 

137 

33 

54 

63 

75 

93 

96 

155 

141 

34 

55 

64 

76 

94 

97 

160 

144 


up its positional rating under the subject’s own age, one looks for it under the 
indices given for the age group 20-24. For the conveni^ce of the r^uier 
these indices are now given ai>art and in more compact form. (See table 38.) 
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TABLE 39 
Verbal SvhtesU 


Aas 

N 

rKl'OBUATlOZr 

|colIFBXHXM8XON 

ABXTHU17T1C 

nxQrr span 

BIMH^ABITTSS 

Mean 

8. D . 

Mean 

S . D . 

Mean 

S.D. 

Mean 

S . D . 

Mean 

S.D. 

7 

50 

2.5 

1.11 

2.5 1 

1.54 : 

1.2 

.97 

5.2 

2.52 

3.0 

1.08 

8 

50 

2.9 

1.56 

3.8 

1.94 

1.9 

1.94 

5.7 

2.16 

3.6 

1.55 

9 

50 

3.9 

2.01 

4.9 : 

1.85 

3.6 

2.31 

6.8 

2.5 

5.0 

1.66 

10 

60 i 

5.7 

2.38 

6.5 i 

1.97 

5.3 

2.7 

7.4 

2.14 

6.7 

2.06 

11 

60 1 

7.6 

3.24 I 

7.5 1 

2.34 

7.1 

3.35 

8.5 

2.61 

7.7 

2.49 

12 

60 

9.8 

2.89 

8.6 

2.59 

8.1 

2.71 

7.9 

2.33 

8.8 

2.13 

13 

70 

9.9 

3.13 1 

9.0 

2.80 

8.8 

2.44 

9.1 

2.00 

9.3 

2.00 

14 

70 

10.5 

2.82 

9.6 

2.72 

9.0 

2.78 

8.6 

2.36 

9.6 

2.16 

15 

100 1 

10.6 

2.61 

9.8 

2.93 

8.9 

2.54 

9.0 

2.63 

9.6 

2.13 

16 

100 

10.4 

2.75 

10.0 

2.98 

9.1 

2.72 

8.8 

: 2.63 

9.8 

2.19 

17-19 

100 

10.5 

2.88 

9.9 

3.07 

8.5 

2.67 

9.3 

; 2.74 

9.5 

2.25 

20-24 

120 

10.4 

2.86 

10.0 

2.76 

9.8 

2.60 

10.1 

2.96 

10.2 

2.06 

25-29 

125 

10.1 

2.98 

10.7 

3.10 

9.7 

2.91 

9.9 

3.13 

10.0 

2.40 

30-34 

110 

9.8 

3.12 

9.7 

3.15 

9.2 

3.31 

8.95 

3.41 

9.5 

2.76 

35-39 

100 

9.8 

3.37 

9.8 

3.18 

9.1 

3.08 

8.7 

3.12 

9.4 

2.67 

40-44 

75 

10.1 

1 3.70 

10.0 

3.32 

8.9 

3.84 

9.0 

2.92 

9.7 

2.67 

45-49 

60 

9.5 

; 3.21 

9.5 

3.33 

9.2 

i 2.94 

7.8 

2.52 

9.3 

2.90 

50-54 

45 

9.6 

4.08 

9.5 

3.00 

8.3 

2.85 

7,7 

3.11 

8.8 

2.94 

55-^59 

' 36 

9.5 

3.86 

8.8 

2.87 

6.9 

4.18 

7.5 

3.23 

7.9 

1 

3.38 


MEANS AND STANDARD DEVIATIONS OF SUBTESTS 
TABLE 40 
Performance Svbtests 


AGX 

N 

PTcnrim 

A&BAKGltMXKT 

PICTOBX 

COMPIXTION 

BLOCK nXBlOX 

OBJECT 

ASSEMBLY 

DIGIT 

BYMBOL 

Mean 

S.D. 

Mean 

S.D. 

Mean 

S.D. 

Mean 

S.D. 

Mean 

S.D. 

7 

50 

4.8 

2,48 

3,0 

2.47 

3.8 

2.14 

3.8 

3.26 

3.6 

1.23 

8 

50 

5.9 

2.64 

3.8 

2.31 

4.0 

1.98 

4.3 

3.22 

4.4 

1.31 

9 

50 

6.9 


4.3 

2.46 

5,2 

2.64 

5.4 

3.35 

5.4 

1.78 

10 

60 

8.4 

2.63 

6.4 

2.74 

6.9 

2.84 

7.5 

2.99 

6.6 

1.81 

11 

60 

9.0 

2.68 

6.6 


7.4 

2.42 

8.8 

2.63 

mmm 

1.93 

12 

60 

9.2 

2.91 

7.3 

3.08 

9.1 

2.48 

9.6 

2.96 

9.2 

2.58 

13 

70 

9.3 


8.7 



2.88 

10.2 

3.25 

10.1 

2.17 

14 

70 

10.2 


9.5 

2.78 

10.7 

2.94 

10.0 

2.96 

10.4 

2.14 

15 

100 

10.8 

3.10 

9.6 

2,95 


3.18 

9.8 

3.45 

10.6 

2.42 

16 

100 

10.8 

2.97 

■SO 

2.79 



10.0 

3.40 

10.7 

2.17 

17-19 


10.8 

3.10 

9.5 

2.75 

11.0 

2.88 

10.7 

j 3.36 

11,1 

2.43 

20-24 

120 

10.5 

2.83 

10.3 

2.86 



10.4 

1 2.93 

10.7 

2.79 

25-29 

125 

10.2 

3.17 

9.8 




9.7 

2.74 

10.4 

3.23 

30-34 

110 

9.2 

3.28 

9.6 

3.30 

9.7 

3.30 

1 9.7 

[ 2.86 

9.2 

3,31 

35-39 

100 

8.7 

3.42 

9.0 

3.39 

9.0 

3.61 

8.7 

3.39 

9.2 

3.17 

40-44 

75 

7.9 

3.14 

9.0 

3.05 

8.5 

2.90 

8.9 

3.42 

8.1 

3.26 

45-49 

60 

7.7 

3.21 

8.2 

2.80 

7.9 

3.17 

8.8 

i 3.39 

7.2 

2.61 


45 

7.3 

2.94 

7.9 

3.13 

8.0 

2.91 

8.7 


1 6.8 

3.20 

65-59 

36 

i 6.6 

2.93 

7.4 

3.41 

6.7 

3.55 

8.1 

3.14 

5.9 

2.85 
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I ±.0264 I ±.0297 | ±.0472 | ±.0352 | ±.04197| ±.0436 | ±.0392 | ±,0499 | ±.0409 

Note: All similarities intercorrelations based on data of 160 cases, ages 15-49. The remainder of the intercorrelations are 
based on the adult samplings of ages 20-34—356 cases. 
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TABLE 42 

Iniertest correlations — each test with every other test — standard scores 
Ages 35-49, cases 235 



! 

§ 

tk 

s 

S * 

1 

1 

fs 

o 

S 

A 

i3§ 

s 

J 

ii 

S 

ni 

g 

O 

P 

Comprebension 

.705 

±.022 








Arithmetic 

.594 

±.029 

.534 

±.031 







Digit Span 

.438 

±.036 

.372 

±.034 

am 

IS 






Picture Arrangement 

.477 

±.034 

.451 

±.035 

.459 

±.035 

. .341 
±.039 





Picture Completion 

.492 

±.034 

.465 

±.035 

.420 

±.036 


.482 

±.034 




Object ALSsembly 

.416 

±.036 

.357 

±.038 

.352 

±.039 

.274 

±.041 

.359 

±.038 

.467 

±.034 



Block Designs 

.597 

±.028 

■ 


.416 

±.036 

.365 

±.038 


.606 

±.033 


Digit Symbol 

.563 

±.030 

,516 

±.032 

.552 

±.031 

.523 

.032 

.516 

±.032 

.433 

±.036 

i .377 
±.038 

.613 

±.02S 


TABLE 43 

Correlations — each test vs. total minus the test {standard scores) 
Ages 20-34, 355 cases 


TXST 

r 


xrjL 

SUOasiCAH'S CBITSBIOX 

Information. 

.667 

±.0198 

.669 

1.03 

Linear 

Comprehension 

.661 

±.0201 

.668 

3.30 

Linear 

Digit Span 

.509 

±.0265 

.531 

8.12 

Linear 

Arithmetic 

.625 

±.0218 

.626 

.639 

Linear 

Picture Arrangement. . 

.514 

±.0263 

.548 

12.89 

Non-linear 

Picture Completion 

.605 

±.0226 

.619 

6.11 

Linear 

Object Assembly 

.409 

±.0297 

.435 

7.63 

Linear 

Block Design 

.714 

±.0175 

.721 

3.58 

Linear 

Digit Symbol 

.673 

±.0195 

.696 

11.15 

Ldnear 

Similarities 

.727 

±.0260 

.764 

8,25 

Linear 


Note; Similarities correlation based on data of 150 cases, ages 15-49. 
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TABLE 44 

Correlations — each test vs. total minus the test {standard scores) 


Ages 35-49, 235 cases 


TEST 

r 

P.E. 

KIA 

CBXTBStZOir 

Information 

.705 

dr .0221 

.724 

6.380 

Linear 

Comprehension 

.082 

dr .0237 

.690 

2.585 

Linear 

Digit Span 

.517 

±.0322 

.530 

3.290 

Linear 

Arithmetic 

.671 

±.0242 

.679 

2.600 

Linear 

Picture Arrangement 

.625 

±.0269 

.640 

4.465 

Linear 

Picture Completion 

.604 

±.0279 

.622 

5,190 

Linear 

Object Assembly 

.508 

±.0326 

.562 

13.560 

Non-linear 

Block Design 

,727 

±.0207 

.763 

12.880 

Non-linear 

Digit Symbol 

.697 

±.o:m 

.697 

.000 

Linear 



Appendix 4 

OBTAINING IQ’S FOR AGES BEYOND THOSE 
GIVEN IN TABLES 

Although some 150 individuals between the ages of 60-80 were examined 
as part of our standardization, the number of subjects per 5 year interval for 
ages beyond 60, were generally too small to permit reliable calculation of IQ’s 
for all scores, on the basis of the data at hand. However, IQ’s for individuals 
between 60-80 may be extrapolated without serious error by the use of some 
simple multipliers which take care of the age factor. The multipliers have 
been obtained by plotting the successive IQ differences from ages 35 years and 
upwards, at different score levels, and then determining the rate change* in 
IQ per year. Evaluation of these changes give the following formulations: 

FvU Sccde 

a. When the subject attains a weighted score below 60, subtract 57.5 from 

the subject’s age and multiply the age difference by 0.6; add this 
product to the IQ assigned to the ^weighted score made by the subject 
under age column 55-59 (Full scale IQ tables). The sum is the sub- 
ject’s corrected IQ. 

b. For subjects attaining weighted scores above 60, multiply the age differ- 

ence by 0.4 and proceed as before. 

Verbal Scale 

a. For subjects attaining weighted scores below 40 j subtract 57.5 from the 
subject’s age and multiply the age difference by 0.4. Add this product 
to the IQ assigned to the weighted score by the subject under age 
column 55-59 (Verbal IQ tables). The sum is the subject’s cor- 
rected IQ. 

*The regression of IQ on age for any given score, beyond age 35, is practi- 
cally a straight line. 
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b. For weighted scores 40 and above, multiply the age difference by 0.2 and 
proceed as before. 

Performance Scale 

a. For weighted scores below SS, subtract 47.5 from the subject's age and 

multiply the age difference by 1.0. Add this product to the IQ as- 
signed to the score made by the subject under age column 45-^9 on 
the performance scale (Performance IQ tables). The sum is the 
subject's corrected IQ. 

b. For weighted scores between ^5-49 multiply age difference by 0.8 and 

proceed as before. 

c. For weighted scores between 50-69 multiply the age difference by 0.6. 

d. For weighted scores of 70 and over, multiply the age difference by 0.4. 

Examples: 

1. Subject age 68, attaining Full Scale score of 85. Subtract 57.5 from 68 

(68 — 57.5 =* 10.5). Since subject’s score is above 60, multiply 
difference by 0.4 (0.4 X 10.5 = 4.2). Now look up subject’s score 
(85) under age column 55-59 of Full Scale IQ table; the figure there 
given is 106. Adding 4.2 to 106 one gets 110.2 (or to nearest unit, 
110), which is the subject’s IQ corrected for age. 

2. Subject age 79, attaining score of 34 on Performance Scale. Subtract 

47.5 from 72 (72 — 47.5 « 24.5). Since subject’s score falls betv^een 
25 and 49, multiply difference by 0.8 (0.8 X 24.5 = 19.6). Now 
look up subject’s score (34) under age column 45-49 of Performance 
IQ tables; the figure there given is 93. Adding 19.6 to 93, one gets 
113, which is the subject Performance IQ corrected for age. 

Appendix 5 

TABLE OF APPROXIMATE C.A. (ADULT M.A.) 
DENOMINATORS FOR BINET SCALES TO 
ALLOW FOR AGE FACTOR 



TABLE 45 
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Appendix 6 

EQUIVALENTS OF WECHSLER-BELLEVUE AND 
STANFORD I.Q. RATINGS IN TERMS OF 
COMPARABLE P.E. CAND PER- 
CENTILE) LIMITS 

TABLE 46 


INCLUDED 


BELLEVUE* 

OLD STANFOBD 

NEW STANFOEDf 

PJE. 

Per- 

ceat 


ADULT 

TOTALt 

8-12 

14-18 

~3 & 

2.2 

Mental defec- 
tive 

67-below 

73-below 

70-below 

68-below 

-2 to -3 

6.7 

Borderline 

79-68 

81-74 

79-71 

79-69 

-1 to -2 

16.1 

Dull normal 

90-79- 

91-82 

91-80 

91-81 

lto-1 

50.0 

Average 

110-91 

108-92 

115-92 

115-92 

— 1 to —2 

16.1 

High average 
to superior 

119-111 

109-17 

! 116-125 

116-126 

—2 to -3 

6.7 

Superior 

120-127 

118-127 

126-137 

127-136 

-3 & 

2.2 

Very'superior 
i 

128-^ver 

128-over 

138-over 

137-over 


* See text, page 190. 

t Conversions based on Merrill percentile distributions for Old and Revised 
Stanford. Jour. Educa. Psychol., 1938, 29: 641-651. 
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Appendix 7 
TABLE 47 


Table of weighted scores 


a 

§ 

2 

H 

1 

» 

BAW SCOBI 

Q 

2 

a 

§8 

Information 

Comprehenition 

Digit Span 

Arithmetic 

Similarities 

Vocabulary 

1 
p+> 
p © 

.la 

PL| 

h 

a 

S 

o 

O 

« 

a 13 

5 o 
o'5 

s 

a 

1 

Q 

8 

n 

>> 

2 

S 

1 

1 

o 

Digit Symbol 

0 


0 

5 

1 




2 


5-6 

0-3 

0 

1 

0 

1 


2 

0 

1-2 


3 

0 

7 

4-7 

1 

2 

1 

2 

6 


1-2 

3-4 

1 

4 

1-2 

8 

8-11 

2 

3 

2-3 

3 

7 

3 

3 

5-6 

2-3 

5 

3-5 

9 

12-15 

3 

4 

4-5 

4 

8 

4 

4 

7-9 

4 

6 

6-7 

rogH 

16-19 

4 

5 

6 

5-6 



5-6 

10-11 

5 


8-10 

12 


5 

6 

7-8 

7 

9 

5 

7 

12-14 

6 

7 

11-12 

13 

24-28 

6 

7 

9 

8 

10 

6 

8 

15-16 

7-8 

8 

13-15 

14-15 

29-32 

7 

8 

10-11 

9 



9-10 

17-19 

9 

9 1 

16-17 

16 

33-36 

8 

9 

12 

10 

11 

7 

11 

20-21 


■a 

18-19 

17 

37-40 

9 

10 

13-14 

11 

12 

8 

12 

22-24 

11 

11 


18 

41-44 

10 

11 

15-16 

12-13 

13 


13-14 

25-26 

12-13 


23-24 

19 

45-48 

11 

12 

17 

14 


9 

15 

27-28 

14 

12 

25-27 

20-21 

49-52 

12 

13 

18-19 

15 

14 

10 

16 

29-31 

15 

13 

28-29 

22 

53-56 

13 

14 

20 

16 

15 


17-18 

32-35 

16-17 

14 

30-32 

23 

57-61 

14 

15 

21-22 

17 


11 

19 

35-36 

18 

15 

33-34 

24 

62-65 

15 

16 

23 

18 

16 

12 

20 

37-38 

19 


35-37 

25 

66-67 

16 

17 

! 24 

19 

17 

13 

21-22 

ElED 

20 and 


38 and 

26 


17 

18 

25 

20 


14 

23-24 

41-42 

over 


over 



18 
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INTELLIGENCE QUOTIENTS— AGES 10-0 TO 10-11 


FULLSCALB 


■WEIGHTED 

SCORE 

10—0 

I.Q. 

10—3 

I.Q. 

10-6 

I.Q. 

10-9 

I.Q. 

WEIGHTED 

SCOBS 

10-0 

I.Q. 

10-3 

I.Q. 

10-6 

I.Q. 

10-9 

I.Q. 

5 

64 

51 

49 

47 

65 

108 

105 

102 

99 

6 

55 

52 

50 

48 

66 

109 

106 

103 

100 

7 

55 

53 

51 

49 

67 

no 

107 

104 

101 

8 

57 

54 

52 

50 

68 

111 

108 

105 

102 

9 

58 

55 

53 

51 

69 

112 

109 

106 

103 

10 

59 

56 

54 

52 

70 

113 

no 

107 

104 

11 

59 

56 

54 

52 

71 

114 

111 

108 

105 

12 

60 

57 

55 

53 

72 

115 

112 

109 

106 

13 

61 

58 

56 

54 

73 

116 

113 

no 

107 

14 

62 

59 

57 

55 

74 

11^ 

113 

no 

107 

15 

63 

60 

58 

56 

75 

118 

114 

111 

108 

16 

64 

61 

59 

57 

76 

118 

115 

112 

109 

17 

65 

62 

60 

58 

77 

119 

116 

113 

no 

18 

66 

63 

61 

59 

78 

120 

117 

114 

111 

19 

67 

64 

62 

60 

79 

121 

118 

115 

112 

20 

67 

64 

62 

60 

80 

122 

119 

116 

112 

21 

68 

65 

63 

61 

81 

123 

120 

117 

114 

22 

69 

66 

64 

62 

82 

124 

120 

117 

114 

23 

70 

67 

65 

63 

83 

125 

121 

118 

115 

24 

71 

68 

66 

64 

84 

126 

122 

119 

116 

25 

72 

69 

67 

65 

85 

127 

123 

120 

117 

26 

73 

70 

68 

66 

86 

128 

124 

121 

118 

27 

74 

71 

69 

66 

87 

129 

125 

122 

119 

28 

75 

72 

70 

67 

88 

130 

126 

123 

120 

29 

76 

73 

70 

68 

89 

131 

127 

124 

121 

30 

77 

74 

71 

69 

90 

131 

128 

125 

122 

31 

78 

75 

72 

70 

91 

132 

129 

126 

123 

32 

79 

76 

73 

71 

92 

133 

129 

126 

123 

33 

79 

76 

74 

72 

93 

134 

130 

127 

124 

34 

80 

77 

75 

72 

94 

135 

131 

128 

125 

35 

81 

78 

76 

73 

95 

136 

132 

129 

126 

36 

82 

79 

77 

74 

98 

137 

133 

130 

127 

37 

83 

80 

77 

75 

97 

138 

134 

131 

128 

38 

84 

1 81 

78 

76 

98 

139 

135 

132 

' 129 

39 

85 

! 82 

79 

77 

99 

140 

1 136 

133 

130 

40 

86 

83 

80 

78 

100 

140 

137 

133 

130 

41 

87 

84 

81 

78 

101 

141 

137 

134 

131 

42 

88 

85 

82 

79 

102 

142 

138 

135 

132 

43 

89 

86 

83 

80 

103 

143 

139 

136 

133 

44 

90 

87 

84 

81 

104 

144 

140 

137 

134 

45 

91 

88 

85 

82 

105 

145 

141 

138 

135 

46 

92 

89 

86 

83 

106 

146 

142 

139 

136 

47 

92 

89 

86 

83 

107 

147 

143 

140 

137 

48 

93 

90 

87 

84 

106 

148 

144 

141 

138 

49 

94 

91 

88 

85 

109 

149 

145 

142 

139 

50 . 

95 

92 

89 

1 86 

110 

149 

145 

142 

139 

51 

96 

93 

90 

! 87 

111 

150 

146 

143 

140 

52 

97 

94 

91 

88 

112 

151 

147 

144 

141 

53 

98 

95 

92 

89 

113 

152 

148 

145 

142 

54 

99 

96 

93 

90 

114 

153 

149 

146 

143 

55 

100 

97 

94 

91 

115 

154 

150 

147 

144 

56 

100 

97 

94 

91 

120 

158 

154 

151 

14$ 

57 

101 

98 

95 

92 

125 

163 

159 

156 

153 

58 

102 

99 

96 

93 

130 

168 

164 

160 

157 

59 

103 

100 

97 

94 

135 

172 

168 

165 

162 

60 

104 

101 

98 

95 

140 

177 

173 

189 

166 

61 

105 

102 

99 

06 

145 

181 

177 

173 

170 

62 

106 

103 

100 

97 

150 

186 

182 

178 

175 

63 

107 

104 

101 

98 

155 

190 

186 

182 

179 

64 

108 

105 

102 

99 

160 

195 

191 

187 

184 
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MEASUREMENT OF ADULT INTELLIGENCE 


INTELLIGENCE QUOTIENTS— AGES 11—0 TO 11—11 


TXJJJU SCAliB 


WEIGHTED 

SCORE 

11-^ 

l.Q. 

11—3 

I.Q. 

11—6 

I.Q. 

11—9 

I.Q. 

WEIGHTED 

SCORE 

11—0 

I.Q. 

11—3 

I.Q. 

11—8 

I.Q. 

11—9 

I.Q. 

5 

45 

43 

42 

40 

65 

97 

65 

93 

91 

6 

46 

44 

43 

41 

66 

98 

96 

94 

92 

7 

47 

45 

43 

41 

67 

99 

97 

95 

93 

8 

48 

46 

44 

42 

68 

100 

98 

96 

94 

9 

49 

47 

45 

43 

69 

101 

98 

96 

94 

10 

50 

48 

46 

44 

70 

102 

99 

97 

95 

11 

50 

48 

47 

45 

71 

103 

100 

98 

96 

12 

51 

49 

47 

45 

72 

104 

101 

99 

97 

13 

52 

50 

48 

46 

73 

105 

102 

100 

98 

14 

53 

51 

49 

47 

74 

105 

103 

101 

99 

15 

64 

52 

50 

48 

75 

106 

104 

102 

100 

16 

55 

53 

51 

49 

76 

107 

104 

102 

100 

17 

56 

54 

52 

60 

77 

108 

105 

103 

101 

18 

57 

65 

53 i 

61 

78 

109 

106 

104 

102 

19 

58 

56 

54 

52 

79 

110 

107 

105 

103 

20 

58 

56 

54 

52 

80 

111 

108 

106 

104 

21 

59 

57 

55 

53 

81 

112 

109 

107 

105 

22 

60 

58 

56 

54 

82 

112 

no 

108 

106 

23 

61 

59 

57 

65 

83 

113 

no 

108 

106 

24 

62 

60 

58 

66 

84 

114 

111 

109 

107 

25 

63 

61 

59 

57 

85 

115 

112 

no 

108 

26 

64 

62 

60 

58 

86 

116 

113 

111 

109 

27 

64 

62 

60 

58 

87 

117 

114 

112 

no 

28 

65 

63 

61 

59 

88 

118 

115 

113 

no 

29 

66 

64 

62 

60 

89 

119 

116 

114 

111 

30 

67 

65 

I 63 

61 

90 

120 

117 

115 

112 

31 

68 

66 

64 

62 

91 

121 

118 

115 

113 

32 

69 

67 

i 65 

1 63 

92 

121 

118 

116 

114 

33 

70 

68 

66 

64 

93 

123 

119 

117 

114 

34 

70 

68 

66 

64 

94 

123 

120 

118 

115 

35 

71 

69 

67 

65 

95 

124 

121 

119 

116 

36 

72 

70 

68 

66 

96 

125 

122 

120 

117 

37 

73 

71 

69 

67 

97 

126 

122 

120 

118 

38 

74 

72 

70 

68 

98 

126 

123 

121 

119 

39 

75 

73 

71 

69 

99 

127 

124 

122 

120 

40 

76 

74 

72 

70 

100 

128 

125 

123 

120 

41 

76 

74 

72 

70 

101 

129 

126 

124 

121 

42 

77 

75 

73 

71 

102 

130 

1 127 

125 

122 

43 

78 

76 

74 

72 

103 

131 

128 

126 

123 

44 

79 

77 

75 

73 

104 

131 

128 

126 

124 

45 

80 

78 

76 

74 

105 

132 

129 

127 

125 

46 

81 

79 

77 

75 

106 

133 

130 

128 

125 

47 

81 

79 

77 

75 

107 

1 134 

131 

129 

126 

48 

82 

SO 

78 

76 

108 

135 

132 

130 

127 

49 

83 

81 

79 

77 

109 

136 

133 

131 

128 

50 

84 

82 

80 

78 

no 

137 

134 

132 

129 

51 

85 

83 

81 

79 

111 

137 

134 

132 

130 

52 

86 

84 

82 

80 

112 

138 

135 

133 

130 

53 

87 

85 

83 

81 

113 

139 

136 

134 

131 

54 

88 

86 

84 

82 

114 

140 

137 

135 

132 

55 

89 

86 

84 

82 

115 

141 

138 

136 

133 

56 

89 

87 

85 

83 

120 

145 

142 

140 

137 

57 

90 

88 

86 

84 

126 

150 

147 

145 

142 

58 

91 

89 

87 

85 

130 

154 

151 

149 

146 

59 

92 

90 

88 

86 

135 

159 

156 

153 

150 

60 

93 

91 

89 

87 

140 

163 

160 

157 

154 

61 

94 

92 

90 

88 

145 

167 

164 

162 

159 

62 

95 

92 

90 

88 

150 

172 

i 169 

166 

163 

63 

96 

93 

91 

89 

155 

176 

173 

170 

1 167 

64 

97 

94 

92 

90 

160 

181 

178 

1 

176 

172 
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INTELLIGENCE QUOTIENTS-AGES 12-0 TO 12-11 


FULL 8CALB 


■WEIGHTED 

SCORE 

12-0 

I.Q. 

12-3 

l.Q. 

12-6 

I.Q. 

12-9 

I.Q. 

WEIGHTED 

SCOBS 

12-0 

I.Q. 

12-3 

I.Q. 

12-6 

I.Q. 

12-9 

I.Q. 

5 

38 

36 

35 

33 

65 

89 

87 

85 

83 

6 

39 

37 

36 

34 

66 

90 

88 

86 

84 

7 

40 

38 

37 

35 

67 

91 

89 

87 

85 

8 

41 

39 

38 

38 

68 

92 

90 

88 

86 

9 

42 

40 

39 

37 

69 

92 

90 

89 

87 

10 

42 

40 

39 

37 

70 

93 

91 

89 

87 

U 

43 

41 

40 

38 

71 

94 

92 

90 

88 

12 

44 

42 

41 

39 

72 

95 

93 

91 

89 

13 

45 

43 

42 

40 

73 

96 

94 

92 

90 

14 

46 

44 

43 

41 

74 

97 

95 

93 

91 

15 

47 

45 

44 

42 

75 

98 

96 

94 

92 

16 

48 

46 

45' 

43 

76 

98 

96 

95 

93 

17 

48 

46 

45 

43 

77 

99 

97 

95 

93 

18 

49 

47 

46 

44 

78 

100 

98 

96 

94 

10 

50 

48 

47 


79 

101 

99 

97 

95 

20 

51 

49 

48 

46 

80 

102 

100 

98 

96 

21 

62 

50 

49 

47 

81 

103 

101 

99 

97 

22 

53 

51 

50 

48 

82 

104 

102 

100 

98 

23 

63 

51 

50 

48 

83 

104 

102 

lOO 

98 

24 

64 

52 

51 

49 

84 

105 

103 1 

101 

99 

25 

55 

53 

52 

SO 

85 

106 

104 

102 

100 

26 

66 

64 

53 

51 

86 

107 

105 

103 

101 

27 

57 

55 

54 

52 

87 

108 

106 

104 

102 

28 

57 

55 

54 

52 

88 

108 

106 

104 

102 

29 

58 

58 

55 

53 

89 

109 

107 

105 

103 

30 

59 

57 

56 

54 

90 

no 

108 

106 

104 

31 

60 

58 

57 

55 

91 

111 

109 

107 

105 

32 

61 

59 

58 

56 

92 

112 

no 

108 

106 

33 

62 

60 

59 

57 

93 

112 

no 

108 

106 

34 

63 

61 

60 

58 

94 

113 

111 

1 109 

107 

55 

63 

61 

60 

58 

95 

114 

112 

110 

108 

36 

64 

62 

61 

59 

96 

115 

113 

111 

109 

37 

65 

63 

62 

60 

97 

116 

114 

112 

110 

38 

66 

' 64 

63 

61 

98 

117 

115 

113 

111 

39 

67 

65 

64 

62 

99 

118 

116 

114 

112 

40 

68 

66 

65 

63 

100 

118 

116 

114 

112 

41 

68 

66 

65 

63 

101 

119 

117 

115 

113 

42 

69 

67 

66 

64 

102 

120 

118 

116 

114 

43 

70 

68 

67 

65 

103 

121 

119 

117 

115 

44 

71 

69 

68 

66 

104 

122 

120 

118 

116 

45 

72 

70 

69 

67 

105 

123 

121 

119 

117 

46 

73 

71 

69 

67 

106 

123 

121 

119 

117 

47 

73 

71 

70 

68 

107 

124 

122 

120 

118 

48 

74 

72 

71 

69 

108 

125 

123 

121 

119 

49 

75 

73 

72 

70 

109 

126 

124 

122 

120 

60 

76 

74 

73 

71 

110 

127 

125 - 

123 

121 

51 

77 

75 

74 

72 

111 

128 

126 

124 

122 

62 

78 

76 

74 

72 

112 

128 

126 

124 

122 

63 

79 

77 

75 

73 

113 

129 

127 

125 

123 

54 

80 

78 

76 

74 

114 

130 

128 

126 

124 

65 

80 

78 

77 

75 

115 

131 

129 

127 

125 

66 

81 

79 

78 

76 

120 

135 

133 

131 

129 

67 

82 

80 

79 

77 

125 

140 

137 

135 

133 

58 

83 

81 

80 

78 

130 

144 

141 

139 

137 

69 

84 

82 

80 

78 

135 

148 

146 

144 

142 

60 

85 

83 

81 

79 

140 

152 

150 

148 

146 

61 

86 

84 

82 

80 

145 

157 

154 

152 

150 

62 

86 

84 

83 

81 

150 

161 

168 

156 

154 

63 

87 

85 

84 

82 

155 

165 

162 

160 

158 

64 

88 

86 

85 

83 

160 

170 

167 

165 

163 
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MEASUREMENT OF ADULT INTELLIGENCE 


INTELLIGENCE QUOTIENTS— AGES 13—0 TO 13—11 


WEIGHTED 

SCORE 

13—0 

I.Q. 

5 

32 

6 

33 

7 

34 

S 

35 

9 

36 

10 

36 

11 

37 

12 

38 

13 

39 

14 

40 

15 

41 

16 

42 

17 

42 

18 

43 

19 

44 

20 

45 

21 

46 

22 

47 

23 

47 

24 

48 

25 

49 

26 

50 

27 

51 

28 

51 

29 

52 

30 

53 

31 

54 

32 

55 

33 

56 

34 

57 

35 

67 

86 

58 

37 

59 

38 

60 

39 

61 

40 

62 

41 

62 

42 

63 

43 

64 

44 

65 

45 

66 

46 

66 

47 

67 

48 

68 

49 

69 

50 

70 

51 

71 

52 

71 

53 

72 

54 

73 

55 

74 

56 

75 

57 

76 

58 

77 

59 

77 

60 

78 

61 

79 

62 

80 

63 

81 

64 

82 


rULL. SCADE 


13—3 

i.Q. 

13—6 

I.Q. 

13—9 

I.Q. 

WEIGHTED 

SCORE 

13—0 

I.Q. 

13—3 

I.Q. 

13— 

I.Q. 

13-9 

si.Q. 

31 

30 

29 

65 

82 

81 

80 

79 

32 

31 

30 

66 

83 

81 

80 

79 

33 

32 

31 

67 

84 

82 

81 

80 

34 

33 

32 

68 

86 

83 

82 

81 

34 

33 

32 

69 

86 

84 

83 

82 

35 

34 

33 

70 

86 

85 

84 

83 

36 

35 

34 

71 

87 

86 

85 

84 

37 

36 

35 

72 

88 

86 

85 

84 

3S 

37 

36 

73 

89 

87 

86 

85 

39 

38 

37 

74 

90 

88 

87 

86 

39 

38 

37 

75 

91 

89 

88 

87 

40 

39 

38 

76 

92 

90 

89 

88 

41 

40 

39 

77 

92 

90 

89 

88 

42 

41 

40 

78 

93 

91 

90 

89 

43 

42 

41 

79 

94 

92 

91 

90 

44 

43 

42 

80 

95 

93 

92 

91 

44 

43 

42 

81 

96 

94 

93 

92 

45 

44 

43 

82 

97 

95 

94 

93 

46 

45 

44 

83 

97 

95 

94 

93 

47 

46 

45 

84 

98 

96 

95 

94 

48 

47 

46 

85 

99 

97 

96 

95 

49 

48 

47 

86 

100 

98 

97 

96 

49 

48 

47 

87 

101 

99 

98 

97 

50 

49 

48 

88 

101 

99 

98 

97 

51 

50 

49 

89 

102 

100 

99 

98 

52 

51 

50 

90 

103 

101 

100 

99 

53 

52 

51 

91 

104 

102 

101 

100 

53 

52 

51 

92 

105 

103 

102 

101 

54 

53 

52 

93 

105 

104 

103 

102 

55 

54 

53 

94 

106 

104 

103 

102 

56 

55 

54 

95 

107 

105 

104 

103 

57 

56 

55 

96 

108 

106 

105 

104 

58 

57 

56 

97 

109 

107 

106 

105 

58 

57 

56 

98 

110 

108 

107 

106 

59 

58 

57 

99 

no 

108 

107 

106 

60 

59 

58 

100 

111 

109 

108 

107 

61 

60 

59 

101 

112 

no 

109 

108 

62 

61 

60 I 

102 

113 

in 

no 1 

109 

62 

61 

60 

103 

114 

112 

111 

no 

63 

62 

61 

104 

115 

113 

112 

110 

64 

63 

62 

1 105 

116 

113 

112 

111 

65 

64 

63 

B 106 

116 

114 

113 

112 

68 

65 

64 

R 107 

117 

115 

114 

113 

67 

66 

65 

I 108 

118 

116 

115 

114 

67 

66 

65 

109 

119 

117 

116 

115 

68 

67 

66 

110 

119 

117 

116 

115 

69 

68 

67 

in 

120 

118 

117 

116 

70 

69 

68 

112 

121 

119 

118 

117 

71 

70 

69 

113 

122 

120 

119 

118 

71 

70 

69 

114 

123 

121 

120 

119 

72 

71 

70 

115 

124 

122 

121 

120 

73 

72 

71 

120 

128 

126 

125 

124 

74 

73 

72 

125 

132 

130 

129 

128 

75 

74 

73 

130 

136 

134 

133 

132 

76 

75 

74 

135 

140 

138 

137 

136 

76 

75 

74 

140 

144 

142 

141 

140 

77 

76 

75 

146 

148 

146 

145 

144 

78 

77 

76 

150 

152 

150 

149 

148 

79 

78 

77 

155 

166 

154 

153 

152 

80 

79 

78 

160 

161 

159 

168 

157 
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INTELLIGENCE QUOTIENTS— AGES 14—0 TO 14—11 


FULL SCALE 


WEIGHTED 

SCORES 

14-0 

I.Q. 

14—3 

I.Q. 

14—8 

I.Q. 

WEIGHTED 

SCORES 

14—0 

I.Q. 

14—3 

r.Q. 

14—6 

I.Q. 

5 

29 

28 

28 

65 

78 

77 

77 

6 

30 

29 

29 

66 

79 

78 

78 

7 

31 

30 

30 

67 

80 

79 

79 

8 

32 

31 

31 

68 

80 

79 

79 

9 

32 

32 

32 

69 

81 

80 

80 

10 

S3 

32 

32 

70 

82 

81 

81 

11 

34 

33 

33 

71 

S3 

82 

82 

12 


34 

34 

72 

84 

83 

83 

13 

38 

35 

35 

73 

85 

84 

84 

14 

37 

36 

36 

74 

85 

84 

84 

15 

37 

37 

37 

75 

86 

85 

85 

16 

33 

37 

37 

76 

87 

86 

86 

17 

39 

38 

38 

77 

88 

87 

87 

18 

40 

39 

39 

78 

89 

88 

88 

19 

41 

40 

40 

79 

89 

88 

88 

20 

42 

41 

41 

80 

90 

89 

89 

21 

42 

41 

41 

81 

91 

90 

90 

22 

43 

42 

42 1 

83 

92 

91 

91 

23 

44 

43 

43 j 

83 

93 

92 

92 

24 

45 

44 

44 

84 

93 

92 

02 

25 

46 

45 

45 

85 

94 

93 

93 

26 

46 

45 

45 

86 

95 

94 

94 

27 

47 

46 

46 

87 

96 

95 

95 

28 

48 

47 

47 

88 

96 

95 

95 

29 

49 

48 

48 

89 

97 

96 

96 

30 

50 i 

49 i 

49 

90 

98 

97 

' 97 

31 

50 

49 

49 

91 

99 

98 

98 

32 

51 

50 

60 

92 

100 

99 

99 

33 ' 

52 

51 

61 

03 

101 

100 

100 

34 

S3 

53 

52 

94 

101 

100 

100 

35 

54 

53 

63 

95 

' 102 

101 

101 

36 

65 

54 

63 

96 

103 

102 

102 

37 

65 

64 

54 

97 

1 104 

103 

103 

38 

56 

55 

65 

98 

105 

104 

104 

39 

57 

56 

56 

99 

! 105 

104 

104 

40 

58 

67 

57 

100 

106 

105 

105 

41 

59 

58 

58 

101 

107 

106 

106 

42 

59 

58 

68 

102 

108 

107 

107 

43 

60 

59 

59 

103 

109 

108 

108 

44 

61 

60 

60 

104 

109 

108 

108 

45 

62 

61 

01 

105 

110 

log 

109 

46 

63 

62 

62 

106 

111 

no 

no 

57 

63 

62 

62 

107 

112 

111 

111 

48 

64 

63 

63 

108 

113 

112 

112 

49 

65 

64 

64 

109 

114 

113 

111 

50 

66 

65 

65 

110 

114 

113 

113 

51 

67 

66 

66 

! Ill 

115 

114 

114 

52 

67 

66 

65 

112 

116 

115 

115 

53 

68 

67 

67 

113 

117 

116 

116 

54 

69 

68 

68 

114 

118 

117 

117 

55 

70 

69 

69 

115 

119 

118 

118 

66 

71 

70 

70 

120 

123 

122 

122 

57 

71 

70 

70 

125 

127 

125 

126 

58 

72 

71 

71 

130 

131 

130 

130 

59 

73 

72 

72 

135 

135 

134 

134 

60 

74 

73 

73 

140 

139 

138 

138 

61 

75 

74 

74 

145 

143 

142 

142 

62 

75 

74 

74 

160 

147 

146 

146 

63 

76 

75 

75 

155 

151 

160 

150 

64 

77 

76 

76 

160 

156 

165 

154 
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INTELLIGENCE QUOTIENTS— AGES 15 TO 19 


TfEIGHTED 

SCOBS 

15 

I.Q. 

5 

29 

6 

30 

7 

31 

S 

31 

9 

32 

10 

33 

11 

34 

12 

35 

13 

35 

14 

36 

15 

37 

16 

38 

17 

39 

18 

89 

19 

40 

20 

41 

21 

42 

22 

43 

23 

43 

24 

44 

25 

45 

26 

46 

27 

46 

28 

47 

29 

48 

30 

49 

31 

50 

32 

50 

33 

51 

34 ! 

52 

35 

53 

36 

53 

37 

54 

38 

55 

39 

56 

40 

57 

41 

57 

42 

58 

43 

59 

44 

60 

45 

61 

46 

61 

47 

62 

48 

63 

49 

64 

50 

65 

61 

65 

52 

66 

53 

67 

54 

68 

55 

68 

56 

69 

57 

70 

58 

71 

59 

72 

60 

72 

61 

73 

62 

74 

63 

75 

64 

. 76 


FUTLL SCALE 


16 

I.Q. 

17 19 

I.Q. 

28 

29 

29 

30 

30 

31 

31 

32 

32 

32 

32 

33 

33 

34 

34 

35 

35 

35 

35 

36 

36 

37 

37 

38 

38 

39 

39 

39 

39 

40 

40 

41 

41 

42 

42 

42 

43 

43 

43 

44 

44 

45 

45 

45 

46 

46 

46 

47 

47 

48 

48 

48 

49 

49 

50 

50 

50 

51 

51 

51 

52 

52 

53 

53 

53 

54 

54 

55 

55 

55 

56 

56 

57 

57 

57 

57 

58 

58 

59 

59 

60 

60 

61 

61 

61 

61 

62 

62 

63 

63 

64 

64 

64 

64 

65 

65 

66 

68 

67 

67 

68 

67 

68 

68 

69 

69 

70 

70 

71 

70 

72 

71 

72 

72 

73 

73 

74 

73 

75 

74 


WEIGHTED 

SCORE 

15 

I.Q. 

65 

76 

66 

77 

67 

78 

68 

79 

69 

80 

70 

80 

71 

81 

72 

82 

73 

83 

74 

83 

76 

84 

76 

85 

77 

86 

78 

87 

79 

87 

80 

88 

81 

89 

82 

90 

83 

90 

84 

91 

85 

92 

86 

93 

87 

94 

88 

94 

89 

95 

90 

96 

91 

97 

92 

98 

93 

98 

94 

99 

95 

100 

96 

101 

97 i 

102 

^ 98 

102 

99 

103 

100 

104 

101 

105 

102 

105 

103 

106 

104 

107 

105 

108 

106 

109 

107 

109 

103 

no 

109 

111 

110 

112 

111 

113 

112 

113 

113 

114 

114 

115 

115 

116 

B 120 

120 

125 

124 

130 

128 

135 

132 

140 

135 

145 

139 

150 

143 

155 

147 

160 

151 


16 

I.Q. 

17-19 

I.Q. 

76 

75 

76 

76 

77 

77 

78 

77 

79 

78 

79 

79 

80 

80 

81 

80 

82 

81 

83 

82 

83 

83 

84 

83 

85 

84 

86 

85 

86 

86 . 

87 

86 

88 

87 

89 

88 

90 

89 

90 

90 

91 

90 

92 

91 

93 

92 

94 

93 

94 

93 

95 

94 

96 

95 

97 

96 

97 

96 

98 

97 

99 

98 

lOO 

99 

101 

99 

101 1 

100 

102 

101 

103 

102 

104 

102 

105 

103 

105 

104 

106 

105 

107 

106 

108 

106 

109 

107 

109 

108 

no 

109 

111 

109 

112 

no 

112 

111 

113 

112 

114 

112 

115 

113 

119 

117 

123 

121 

127 

125 

130 

128 

135 

132 

138 

136 

142 

140 

146 

144 

150 

147 


2SS2S gS55gS! SSSSS fegijgg; fe&fe 


INTELLIGENCE QUOTIENTS TABLES 
INTELLIGENCE QUOTIENTS-AGES 20 TO 39 


■WEIGHTED JO-24 25-29 I 30-34 35-39 | weighted , 20-24 25-29 30-34 | 35-39 

8COBE I.Q. I.Q. I.Q. I.Q. 3 SCOBE I.Q. I.Q, I.Q. I.Q, 
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INTELLIGENCE QUOTIENTS— AGES 10-0 TO 10—11 


TXRBAli SCAIiE 


WEIGHTED 

fiCOBES 

10-0 

LQ. 

10-3 

LQ. 

10—8 

LQ. 

10-0 

LQ. 

WEIGHTED 

8COBX8 

10—0 

LQ. 

10—3 

LQ. 

10-6 

LQ. 

10—9 

LQ. 

5 

65 

62 

60 

58 

45 

131 

127 

124 

121 

6 

67 

64 

62 


46 

132 

128 

125 

122 

7 

68 

65 

63 

61 

47 

134 

130 

127 

124 

8 

70 

67 

65 

63 

48 

136 

132 

129 

126 

9 

72 

69 

67 

65 

49 

138 

134 

130 

127 

10 

74 

71 

68 

66 

50 

139 

135 

132 

129 

11 

75 

72 

70 

68 

51 

141 

137 

133 

130 

12 

77 

74 

71 

69 

52 

143 

139 

135 

132 

13 

79 

76 

73 

71 


144 

140 

137 

134 

14 

80 

77 

75 

73 

-w 

146 

142 

138 

i35r-“ 

15 

82 

79 

76 

74 

55 

148 

144 

140 

137 

16 

84 

81 

78 

76 

56 

149 

145 

141 

138 

17 

85 

82 

79 

77 

57 

151 

147 

143 

140 

18 

87 

84 

81 

79 

58 

153 

149 

145 

141 

19 

89 

86 

83 


59 

154 

150 

146 

143 

20 

90 

87 

84 

82 

60 

156 

152 

148 

144 

21 

92 

89 

86 

83 

61 

157 

153 

149 

146 

22 

93 

90 

87 

85 

62 

159 

155 

151 

147 

23 

95 

92 

89 . 

86 

63 

160 

156 

152 

149 

24 

96 

93 

90 

88 

64 

162 

158 

154 

150 

25 

98 

95 

92 

89 

65 

164 

160 

156 

152 

26 


97 

94 

91 

66 

165 

1 161 

157 

153 

27 

101 

98 

95 

92 

67 

167 

! 163 

159 

155 

28 

103 

100 

97 

94 

68 

169 

164 

160 

156 

29 

105 

101 

98 

95 

69 

170 

166 

162 

158 

30 

106 

103 

100 

97 

70 

172 

1 168 

164 

160 

31 

108 

105 

102 

99 

71 

174 

169 

165 

161 

32 

no 

106 

103 

100 . 

72 

175 

171 

167 

163 

33 

111 

108 

1 105 

102 

73 

177 

172 

168 

164 

34 

113 

109 

106 

[ 

103 

74 

179 

1 

174 

170 

166 

35 

115 

111 

1 108 

105 

75 

! 180 

176 

172 

168 

36 

117 

113 

no 


76 

1 182 

177 

^-173 

169 

37 

118 

114 

111 

108 

77 

184 

179 

175 

171 

38 

120 

116 

113 

■BH 

78 

185 

180 

176 

172 

39 

121 

117 

114 

111 

79 

187 

182 

178 

174 

40 

123 

119 

116 

113 


188 

183 

179 

175 

41 

124 

120 

117 

114 

81 

190 

185 

181 

177 

42 

126 

122 

119 

116 

82 

192 

187 

183 

1 178 

43 

128 

124 

121 

118 

83 

193 

1 188 

184 

t 180 

44 

129 

1 125 

! 122 

119 

84 

195 

1 190 

186 

1 182 
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INTELLIGENCE QUOTIENTS— AGES 11-0 TO 11—11 

VEEBAL SCALE 


\ncxaHTXD 

8COBX8 

11-0 

I.Q. 

11-0 

I.Q. 

11—6 

LQ. 

11-9 

LQ. 

WZIOSTXD 

8C0BS3 

ll-H) 

I.Q. 

11-3 

LQ* 

11—6 

LQ. 

11-9 

LQ. 

5 

56 

54 

53 

51 

45 

118 

115 


Ill 

6 

58 

56 

55 

53 

46 

119 

116 

B ’ S 

112 

7 

59 

57 

56 

54 

47 

121 

118 


114 

8 

61 

59 

58 

56 

48 

123 

120 

Bn'S 

115 

9 

63 

61 

59 

57 

49 

124 

121 

119 

117 

10 

64 

62 

61 

59 

50 

126 

123 

120 

118 

11 

66 

64 

62 

60 

51 

127 

124 

122 

120 

12 

67 

65 

64 

62 

52 

129 

126 

123 

121 

13 

69 

67 

65 

63 

53 

131 

128 

125 

123 

14 

71 

69 

67 

65 

54 

132 

129 

126 

124 

15 

72 

70 

68 

66 

55 

134 

131 

12$ 

126 

16 

74 

72 

70 

68 

56 

135 

132 

129 

127 

17 

75 

73 

71 

69 

57 

137 

134 

131 

129 

18 

77 

75 

73 

71 

58 

138 

135 

132 

130 

19 

78 

76 

74 

72 

59 

140 

137 

134 

132 

20 

80 

78 

76 

74 

60 

141 

138 

135 

133 

21 

81 

79 

77 

75 

61 

143 

140 

137 

135 

22 

83 

81 

79 

77 

62 

144 

141 

138 

136 

23 

84 

82 

80 

78 

63 

146 

143 

140 

138 

24 

86 

84 

82 


64 

147 

144 

141 

139 

25 

87 

85 

83 

81 

65 

149 

146 

143 

141 

26 

89 

87 

85 

83 

66 

150 

147 

144 

142 

27 


88 

86 

84 

67 

152 

149 

146 

144 

28 

92 

89 

87 

85 

68 

153 

150 

147 

145 

29 

93 

91 

89 

87 

69 

155 

152 

149 

147 

30 

95 

92 

90 

88 

70 

157 

153 

150 

148 

31 

97 

94 

92 


71 

158 

154 

151 

149 

.32 

98 

96 

94 

92 

72 

160 

156 

153 

151 

33 


97 

95 

93 

73 

161 

158 

155 

153 

34 


98 

96 

94 

74 

163 

159 

156 

154 

35 

103 

100 

98 

96 

75 

165 

161 

158 

156 

36 

■Els 

1 101 

99 

97 

76 

166 

162 

159 

157 

37 

106 

103 

■EH 

99 

77 

167 

163 

160 

158 

38 

107 

104 

102 

■Eh^bi 

78 

169 

165 

162 

160 

39 

109 

106 

104 

102 

79 

170 

166 

163 

161 

40 

110 

107 

105 

103 

80 

172 

168 

165 

163 

41 

112 

109 

107 

105 

81 

173 

169 

166 

164 

42 

113 

■luS 


106 

82 

174 

171 

168 

166 

43 

115 

112 

■nMi 

108 

83 

176 

172 

169 

167 

44 

116 

113 

111 1 

mm 

84 

178 

174 

171 

169 
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INTELLIGENCE QUOTIENTS— AGES 12— 0 TO 12—11 


VEEBAl, SCALE 


"WEIGHTED 

SCORES 

12—0 

I.Q. 

12—3 

I.Q. 

12—6 

I.Q. 

12—9 

I.Q. 

WEIGHTED 

SCORES 

12—0 

I.Q. 

12—3 

I.Q. 

12—6 

LQ. 

12-0 

I.Q. 

5 

50 

48 

47 

46 

45 

109 

107 

106 

105 

6 

51 

49 

48 

47 

46 

110 

108 

107 

106 

7 

53 

51 


49 

47 

112 

110 

109 

108 

8 

54 

52 

51 

60 

48 

113 

111 

110 

109 

9 

56 

54 

53 

52 

49 

115 

113 

112 

111 

10 

57 

55 

54 

53 

50 

116 

114 

113 

112 

11 

59 

57 

56 

55 

51 

118 

116 

115 

114 

12 

61 

59 

57 

56 

52 

119 

117 

116 

115 

13 

62 

60 

59 

58 

53 

121 

119 

118 

117 

14 

63 

61 


59 

54 

122 

120 

119 

118 

15 

65 

63 


61 

55 

124 

122 

121 

120 

16 

66 

64 


62 

56 

125 

123 

122 

121 

17 

68 

66 . 

65 

64 

57 

127 

125 

124 

122 

18 

69 

67 

66 

65 

58 

128 

126 

125 

124 

19 

71 

69 

68 

67 

59 

130 

128 

127 

125 

20 

72 

70 

69 

68 

60 

131 

129 

128 

127 

21 

74 

72 

71 

70 

61 

133 

131 

130 

128 

22 

75 

73 

72 

71 

62 

134 

132 

131 

130 

23 

77 

75 

74 

73 

63 

136 

134 

133 

131 

24 

78 

76 

75 

74 

64 

137 

135 

134 

133 

25 

80 

78 

77 

76 

65 

139 

137 

136 

134 

26 

81 

79 

78 

77 

66 

140 

138 

137 

136 

27 

82 

80* 

79 

78 

67 

142 

140 

139 

137 

28 

84 

82 

81 

80 

68 

. 143 

141 

140 

138 

29 

85 

83 

82 

81 

69 

145 

143 

142 

140 

30 

87 

85 

84 

83 

70 

146 

144 

143 

141 

31 

88 

86 

85 

84 

71 

148 

146 

145 

143 

32 

90 

88 

87 

86 

72 

149 

147 

146 

144 

33 

91 

89 

88 

87 

73 

151 

149 

148 

146 

34 

93 

91 

90 

89 

74 

152 

150 

149 

147 

35 

94 

92 

91 

90 

75 

154 

152 

150 

149 

36 

96 

94 

93 

92 

76 

155 

153 

152 

150 

37 

97 

95 

94 

93 

77 

156 

154 

153 

151 

38 

99 

97 

96 

95 

78 

158 

156 

155 

153 

39 

100 

98 

97 

96 

79 

159 

157 

156 

154 

40 

102 

100 

99 

98 

80 

161 

159 

158 

156 

41 

103 

101 

mm 

99 

81 

162 

160 

159 

157 

42 

105 

103 


101 

82 

164 

162 

161 

159 

43 

106 

104 


102 

83 

165 

163 

162 

160 

44 

108 

106 


104 

84 

167 

165 

164 

162 
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INTELLIGENCE QUOTIENTS— AGES 13—0 TO 13—11 


VEEBAIi SCALE 


"WTIGHTED 

SCORES 

13—0 

I.Q. 

13—3 

I.Q. 

13—0 

LQ. 

13—9 

LQ. 

■WEIGHTED 

SCORES 

13—0 

I.Q. 

13—3 

LQ. 

13—6 

LQ. 

13—9 

LQ. 

5 

45 

44 

44 

43 

45 

104 

103 

102 

101 

0 

47 

46 

46 

45 

46 

105 

104 

104 

103 

7 

48 

47 

47 

46 

47 

107 

106 

105 

104 

8 

50 

49 

49 

48 

48 

108 

107 

106 

105 

9 

51 

50 

50 

49 

49 

110 

109 

108 

107 

10 

63 

52 

52 

51 

50 

111 

110 

109 

108 

11 

54 

53 

53 

52 

51 

113 

112 

111 

110 

12 

55 

54 

54 

53 

52 

114 

113 

112 

111 

13 

67 

56 

56 

55 

53 

116 

115 

114 

113 

14 

68 

57 

57 

56 

64 

117 

116 

115 

114 

16 

60 

59 

59 

58 

65 

119 

118 

117 

116 

16 

61 

60 

60 

59 

66 

120 

119 

118 

117 

17 

63 

62 

62 

61 

57 

121 

120 

119 

118 

18 

64 

63 

63 

62 

58 

123 

122 

121 

120 

19 

66 

65 

65 

64 

59 

124 

123 

122 

121 

20 

67 

66 

66 

65 

60 

126 

125 

124 

123 

21 

69 

68 

67 

66 

61 

127 

126 

* 125 

124 

22 

70 

69 

6sr 

68 

62 

129 

128 

127 

126 

23 

72 

71 

70 

69 

63 

130' 

129 

128 

127 

24 1 

73 

72 

72 

71 

64 

132 

131 

130 

129 

26 

75 

74 

73 

72 

65 

133 

132 

131 

130 

26 

76 

75 1 

75 

74 

66 

135 

134 

133 

132 

27 

77 

76 

76 

75 

67 

136 

135 

134 

133 

28 

79 

78 

78 

77 

68 

137 

136 

135 

134 

29 

80 

79 

79 

78 

69 

139 

138 

137 

136 

30 

82 

81 

81 

80 

70 

140 

139 

138 

137 

31 

83 

82 

82 

81 

71 

142 

141 

140 

139 

32 

85 

84 

83 

82 

72 

143 

142 

141 

140 

33 

86 

85 

85 

84 

73 

145 

144 

143 

142 

34 

88 

87 

86 

85 

74 

146 

145 

144 

143 

35 

89 

88 

88 

87 

75 

148 

147 

146 

145 

36 

91 

90 

89 

88 

76 

149 

148 

147 

146 

37 

92 

91 

91 

90 

77 

150 

149 

148 

147 

38 

94 

93 

92 

91 

78 

152 

151 

150 

149 

39 

95 

94 

94 

93 

79 

153 

152 

151 

150 

40 

97 

96 

95 

94 

80 

155 

154 

153 

152 

41 

98 

97 

96 

95 

81 

156 

155 

154 

153 

42 

100 

99 

98 

97 

[ 82 

158 

157 

156 

155 

43 

101 

100 1 

99 

98 

1 83 

159 

158 

157 

156 

44 

103 

102 1 

101 

100 

84 

161 

160 

159 

158 
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INTELLIGENCE QUOTIENTS— AGES 14-0 TO 14—11 


VERBAL SCALE 


'WfiXCtHTSD 

fiCOHSS 

14-0 

LQ. 

14-3 

LQ. 

14—6 

LQ. 

WBIGHTBD 

8CO&SS 

14-0 

I.Q. 

14-S 

I.Q. 

14-0 

I.Q. 

5 

43 


43 

45 

101 

100 

100 

6 

45 


44 

46 

103 

102 

102 

7 

46 


46 

47 

104 

103 

103 

8 

48 


47 

48 

105 

104 

104 

9 

49 


48 

49 

107 

106 

106 

10 

51 

50 

50 

50 

108 

107 

107 

11 

52 

51 

51 

51 

110 

109 

109 

12 

53 

53 

53 

52 

111 

110 

110 

13 

55 

54 

54 

53 

113 

112 

112 

14 

56 

56 

56 

54 

114 

113 

113 

15 

58 

57 

57 

55 

115 

114 

114 

16 

59 

58 

58 

56 

117 

116 

116 

17 

61 

60 

60 

57 

118 

117 

117 

18 

62 

61 

61 

58 

120 

119 

119 

19 

64 

63 

63 

59 

121 

120 

120 

20 

65 

64 

64 

60 

123 

122 

122 

21 

66 

66 

66 

61 

124 

123 

123 

22 

68 

67 

67 

62 

125 

124 

124 

23 

69 

68 

68 

63 

127 

126 

126 

24 

71 

70 

70 

64 

128 

127 

127 

25 

72 

71 

71 

65 

130 

129 

129 

20 

74 

73 

73 

66 

131 

130 

130 

27 

75 

74 

74 

67 

133 

132 

132 

28 

77 

76 

76 

68 

134 

133 

133 

29 

78 

77 

77 

69 

136 

135 

135 

30 

80 

79 

79 

70 

137 

136 

136 

31 

81 

80 

80 

71 

138 

137 

137 

32 

82 

81 

81 

72 

140 

139 

1 139 

33 

84 

83 

83 

73 

141 

140 

1 140 

34 

85 

84 

84 

74 

143 

142 

142 

35 

87 

86 

86 

75 

144 

143 

143 

36 

88 

87 

87 

76 

146 

145 

145 

37 

90 

1 89 

89 

77 

147 

146 

146 

38 

91 

90 

90 

78 

148 

147 

147 

39 

92 

91 

91 

79 

150 

149 

149 

40 

94 

93 

93 

80 

151 

150 

150 

41 

95 

94 

94 

81 

153 

152 

152 

42 

97 

96 

96 

82 

154 

153 

153 

43 

98 

97 

97 

83 

156 

155 

155 

44 

100 

99 

99 

84 

157 

156 

156 
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MEASUREMENT OE ADULT INTELLIGENCE 


INTELLIGENCE QUOTIENTS— AGES 15 TO 19 


TESBAIi SCABS 


’W’EIGSTED 

SCOBSS 

15 

LQ. 

16 

I.Q. 


'WlEtQHTBD 

SCOBXS 


16 

I.Q. 

17-19 

I.Q. 

5 

45 

46 

47 

45 

99 

98 

98 

6 

46 

47 

48 

46 

101 

100 

^ 99 

7 

48 

48 

49 

47 

102 

101 

100 

8 

49 

49 

50 

48 

104 

102 

101 

9 

51 

51 

52 

49 

105 

104 

103 

10 

52 

52 

53 

50 

106 


104 

11 

53 

53 

54 

51 



105 

12 

55 

55 

55 

52 

109 

108 

107 

13 

56 

56 

57 

53 


109 

108 

14 

57 

57 

58 

54 

112 


109 

13 

59 

59 

59 

55 

113 

112 


16 

60 

60 

61 

56 

114 

113 


17 

61 

61 

62 

57 

116 

114 

■ 

18 

63 

63 

63 

58 

117 

116 

114 

19 

64 

64 

64 

59 

118 

117 

116 

20 

65 

65 

66 

60 

120 

118 

117 

21 

67 

67 

67 

61 

121 

119 

118 

22 

68 

68 

68 

62 

122 

121 

119 

23 

69 

69 

70 

63 

124 

122 

121 

24 

71 

71 

71 

64 

125 

123 

122 

23 

72 i 

72 

72 

65 

126 

125 


26 

74 1 

73 

73 

66 

128 

126 


27 

75 

75 

75 

67 

129 

127 


28 1 

76 

76 

76 

ea 

i3r 

129 


29 

78 


77 

69 

132 

130 


30 

79 


79 

70 

133 

131 

130 

31 

80 


80 

71 

135 

133 

131 

32 

82 

81 

81 

72 

136 

134 

132 

33 

83 

83 

82 

73 

137 

135 

134 

34 

84 

84 

84 

74 

139 

137 

135 

35 

86 

85 

85 

75 

140 

138 

mM 

36 


86 

86 

76 

141 

139 


37 

89 

88 

88 

77 

143 

141 

■Ih 

38 

90 

89 

89 

78 

144 

142 

ksB 

39 

91 

90 

90 

79 

145 

143 

msM 

40 

93 

92 

91 

80 

1 147 

144 

143 

41 

94 

93 

93 

81 

148 

146 

144 

42 

95 

94 

94 

82 

149 

147 

145 

43 

97 

96 

95 

83 

151 

148 

146 

44 

98 

97 

96 1 

84 

152 

150 

148 
















INTELLIGENCE QUOTIENTS TABLES 
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INTELLIGENCE QUOTIENTS— AGES 20 TO 39 


VERBAL SCALE 


WEIGHTED 

SCORES 

20-24 

I.Q. 

25-29 

I.Q. 

30-34 

LQ. 

35-39 

I.Q. 

WEIGHTED 

SCORES 

20-24 

I.Q. 

25-29 

I.Q. 

30-34 

I.Q. 

35-39 

I.Q. 

5 

47 

50 

51 

54 

45 

97 

98 

98 

99 

6 

49 

51 

52 

55 

46 

99 

99 

99 

101 

7 

50 

53 

53 

56 

47 

100 

100 

101 

102 

8 

51 

54 

54 

57 

48 

101 

101 

102 

103 

9 

52 

55 

56 

58 

49 

103 

102 

103 

104 

10 

54 

56 

57 

60 

50 

104 

104 

104 

105 

11 

55 

57 

58 

61 

51 

105 

105 

105 

106 

12 

56 

59 

59 

62 

52 

106 

106 

107 

107 

13 

57 

60 

60 

63 

53 

107 

107-^ 

108 

109 

14 

59 

61 

62 

64 

54 

109 

108 

109 

no 

15 

60 

62 

63 

65 

55 

110 

no 

no 

111 

16 

61 

63 

64 

66 

56 

111 

111 

111 

112 

17 

62 

64 

65 

68 

57 

112 

112 

112 

113 

18 

64 

66 

66 

69 

58 

113 

113 

114 

114 

19 

65 

67 

67 

70 

59 

114 

114 

115 

115 

20 

66 

68 

69 

71 

60 

115 

115 

116 

116 

21 

67 

69 

70 

72 

61 

117 

117 

117 

118 

22 

69 

70 

71 

73 

62 

118 

118 

118 

119 

23 

70 

71 

72 

74 

63 

119 

119 

120 

120 

24 

71 

73 

73 

76 

64 

120 

120 

121 

121 

25 ' 

72 

74 

75 

77 

65 

121 

! 121 

122 

122 

26 

74 

75 

76 

78 

66 

122 

123 

123 

123 

27 

75 

76 

77 

79 

67 

124 

124 

124 

124 

28 

76 

78 

78 

80 

68 

125 

125 

126 

126 

29 

•77 

79 

79 

81 

er”' 

■ 126 

126 

127 

127 

30 

79 

80 

80 

82 

70 

127 

127 

128 

128 

31 

80 

81 

82 

83 

71 

129 

129 

129 

129 

32 

81 

82 

83 

85 

72 

130 

130 

130 

130 

33 

82 

83 

84 

86 

73 

131 

131 

131 

131 

34 

84 

85 

85 

87 

74 

132 

132 

133 

133 

35 

85 

86 

86 

88 

75 

133 

133 

134 

134 

36 

86 

87 

88 

89 

76 

134 

134 

135 

135 

37 

87 

88 

89 

90 

77 

135 

136 

136 

136 

38 

89 

89 

90 

91 

78 

137 

137 

137 

137 

39 

90 

90 

91 

93 

79 

140 

138 


138 

40 “ 

91 

92 

92 

94 

80 

141 

139 

140 

139 

41 

93 

93 

93 

95 

81 

143 

140 

141 

140 

42 

94 

94 

95 

96 

82 

144 

142 

142 

142 

43 

95 

95 

96 

97 

83 

145 

143 

143 

143 

44 

96 

96 

97 

98 

84 

146 

144 

145 

144 
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MEASUREMENT OF ADULT INTELLIGENCE 


INTELLIGENCE QUOTIENTS— AGES 40 TO 50 


VEBBAI* SCALE 


WEIGHTED 

BCOEZS 

BSSI 

WM 

45-49 

LQ. 

60-54 

LQ. 

55-59 

I.Q. 

’VFXaGHTZI 

BCOBSa 

>1 40-44 
LQ. 

45-49 

LQ. 

50-64 

LQ. 

55-69 

I.Q. 

5 

56 

58 

61 

63 

45 

101 


103 

104 

6 

57 

59 

62 

64 

46 

102 


104 

105 

7 

58 

61 

63 

65 

47 

103 


105 

106 

8 

59 

62 

64 

66 

48 

104 

105 

106 

107 

9 

60 

63 

65 

67 

49 

105 

106 

107 

108 

10 

62 

64 

[66 

68 1 

50 

106 

107 

108 

109 

11 

63 

65 

67 

69 I 

51 

107 

108 

109 

110 

12 

64 

66 

68 


52 

108 

109 

110 

111 

13 

65 

67 

69 

71 

53 

109 

110 

111 

112 

14 

66 

68 

\mrsm 

72 

54; 

111 

111 

112 

113 

15 

67 

69 

7l 

73 

55 

112 

113 

114 

U4 

16 

68 

70 

72 

74 

56 

113 

114 

115 

115 

17 

69 

71 

73 

75 

57 

114 

115 

116 

116 

18 

70 

72 

74 

76 

58 

115 

116 

117 

117 

19 

72 

74 

75 

77 

59 

116 

117 

118 

118 

20 

73 ' 

75 

76 

79 

60 

117 

118 

119 

119 

21 

74 

76 

78 


61 

118 

119 

120 

120 

22 

Vi> i 

77 

79 

81 

62 

120 

120 

121 

121 

23 

76 

78 

80 

82 

63 

121 

121 

122 

122 

24 

77 

79 

84. 

83 

64 

122 

122 

m 

123 

25 

78 

80 

82 

84 

65 

123 

m 

124 

124 

26 

79 

81 

83 

85 

66 

124 

124 

125 

125 

27 

80 

82 

84 

86 

67 

125 

125 

126 

126 

28 

82 

83 

85 

87. 

68 

1 126 

127 

127 

127 

29 

83 

84 

86 

88 

69 

127 

128 

128 

129 

30 

84 

85 

87 

$9 

70 

128 

129 

130 

130 

31 

85 

86 

88 

90 ‘ 

71 

129 

130 

131 

131 

32 

86 

87 

89 

91 

72 

130 

131 

132 

132 

33 

87 

89 

90 

92 

73 

132 

132 

133 

133 

34 

88 

90 

91 

93 

74 

133 

133 

134 

134 

35 

89 

91 

92 

94 

75 

.134 

134 

135 

135 

36 

91 

92 

93 

95 

76 

135 

135 

136 

136 

37 

92 

93 

95 

96 

77 

136 

136 

137 

137 

38 

93 

94 

96 

97 

78 

137 

137 

138 

138 

39 

94 

95 

97 

98 

79 

138 

138 

139 

139 

40 

95 

96 

98 

99 

80 

140 

139 ‘ 

140 

140 

41 

96 

97 

99 

100 

81 

141 

141 

141 

141 

42 

97 

98 

100 


82 

142 

142 

142 

142 

43 

98 

99 

101 

102 

83 

143 

143 

143 

143 

44 

99 

101 

102 

Bail 

84 

144 

144 

144 

144 












INTELLIGENCE QU0T1£!HTS TABLES 
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INTELLIGENCE QUOTIENTS— AGES 10-0 TO 10—11 

PERFORMANCE SCALE 


-W-EiaHTED 
SCO BBS 

10—0 

I.Q. 

10—3 

I.Q. 

10—8 

I.Q. 

10—9 

I.Q. 

WBIQHTED 

eco»£S 

10-0 

I.Q. 

10—3 

I.Q. 

10-« 

LQ. 

10-9 

LQ. 

5 

63 

61 

59 

57 

45 

125 

123 

121 

118 

6 

65 

63 

61 

59 

46 

127 

124 

122 

119 

7 

66 

64 

62 

60 

47 

129 

126 

124 

121 

8 

68 

66 

64 

62 

48 

130 

127 

125 

123 

9 

69 

67 

65 

63 

49 

132 

129 

127 

124 

10 

71 

69 

67 

65 

50 

133 

130 

128 

125 

11 

72 

70 

68 

66 

51 

135 

132 

130 

127 

12 

74 

72 

70' 

68 

52 

136 

133 

131 

128 

13 

75 

73 

71 

69 

53 

138 

135 

133 

130 

14 

77 

75 

73 

71 

54 

140 

137 

135 

132 

15 

79 

77 

75 

73 

55 

141 

138 

136 

133 

16 

80 

78 

76 

74 

56 

143 

140 

138 

135 

17 

81 

79 

77 

75 

57 

144 

141 

139 

136 

18 

83 

81 

79 

77 

58 

146 

143 

141 

138 

19 

85 

83 

81 

79 

59 

148 

145 

143- 

140 

20 

86 

84 

82 

80 

60 

149 

146 

144 

141 

21 

88 

86 

84 

82 

61 

160 

147 

145 

142 

22 

89 

87 

85 

83 

62 

152 

149 

147 

144 

23 

91 

89 

87 

85 

63 

153 

160 

148 

145 

24 

92 

90 j 

88 

86 

64 

155 

152 

150 1 

147 

25 

94 

92 

90 

88 

65 

156 

153 

151 

148 

26 

95 

93 

91 

89 

66 

158 

155 

163 

150 

27 

97 

95 1 

93 - 

91 

67 

160 

157 

155 

152 

28 

99 

97 

95 

92 

68 

161 

158 

156 

153 

29 

100 

98 1 

96 

94 

69 

163 

160 

168 

155 

30 

102 

100 

98 

95 

70 

164 

161 

159 

156 

31 

103 

101 i 

99 

97 

71 

165 

163 

161 

158 

82 

105 

103 

101 

98 

72 

167 

164 

162 

159 

33 

107 

104 

102 

100 

73 

169 

166 

164 

161 

34 

108 

106 

104 

101 

74 

171 

168 

165 

162 

35 

110 

107 

105 

103 

75 

172 

169 

167 

164 

36 

111 

109 

107 

104 

76 

174 

171 

168 

165 

37 

113 

no 

108 

106 

77 

175 

172 

170 

167 

38 

114 

112 

no 

107 

78 

177 

174 

171 

168 

39 

116 

113 

111 

109 

79 

179 

176 

173 

170 

40 

118 

115 

113 

no 

80 

180 

177 

175 

172 

41 

119 

117 

115 

112 

81 

182 

179 

176 

173 

42 

121 

118 

116 

113 

82 

183 

180 

178 

175 

43 

122 

120 

118 

115 

$3 

185 

181 

179 

176 

44 

124 

121 

119 

116 

84 

186 

183 

181 

178 



248 MEASUHEMENT OF ADULT INTELLIGENCE 


INTELLIGENCE QUOTIENTS— AGES H-OTO 11— H 

PERFOBUJLNCII SCALE 


WSIGHTSD 

0COBB9 

11-0 

I.Q. 

11-0 

LQ. 

11-6 

I.Q. 

11-0 

I.Q. 

WSXGHTSO 

fiCOBBS 

11-0 

I.Q. 

U-3 

LQ. 

11-6 

I.Q. 

11-0 

LQ, 

5 

65 

63 

52 

60 

46 

116 

114 

112 

110 

6 

67 

66 

64 

62 

46 

117 

115 

113 

111 

7 

68 

66 

66 

63 

47 

119 

117 

115 

113 

8 

60 

68 

67 

55 

48 

120 

118 

116 

114 

9 

61 

69 

68 

66 

49 

122 

120 

118 

116 

10 

63 

61 

60 

58 

60 

123 

121 

119 

117 

11 

64 

62 

61 

69 

61 

125 

123 

121 

119 

12 

66 

64 

62 

60 

62 

126 

124 

122 

120 

13 

67 

65 

64 

62 

63 

128 

126 

124 

122 

14 

69 

67 

66 

64 

64 

130 

127 

125 

123 

16 

71 

69 

67 

65 

65 

131 

129 

127 

125 

16 

72 

70 

69 

67 

66 

.133 

130 

128 

126 

17 

73 

71 

70 

68 

67 

134 

132 

130 

128 

18 

75 

73 

72 

70 

68 

136 

133 

131 

129 

19 

77 

75 

73 

71 

69 

138 

135 

133 

131 

20 

78 

76 

74. 

72 

60 

139 

136 

134 

132 

21 

80 

78 

76 

74 

61 

140 

138 

136 

134 

22 

81 

79 

77 

75 

62 

142 

139 

137 

135 

23 

83 

81 

79 

77 

63 

143 

141 

139 

137 

24 1 

84 

82 

81 

79 

64 

145 

142 

140 

138 

26 

86 

84 

82 

80 

65 

146 

144 

142 

140 

26 

87 

85 

83 

81 

66 

148 

146 

143 

141 

27 

89 

87 

85 

83 

67 

160 

147 

145 

143 

28 

90 

88 

86 

84 

68 

161 

148 

146 

144 

29 

92 

90 

88 

86 

69 

163 

150 

148 

146 

39 

93 

91 

89 

87 

70 

154 

151 

149 

147 

31 

96 

93 

91 

89 

71 

165 

162 

150 

148 

32 

96 

94 

92 

90 

72 

167 

164 

162 

150 

33 

98 

96 

94 

92 

73 

159 

156 

154 

152 

34 

99 

97 

95 

93 

74 

160 

157 

155 

153 

36 

101 

99 

97 

96 

76 

161 

158 

166 

154 

36 

102 

100 

98 

96 

76 

163 

160 

158 

166 

37 

104 

102 

100 

98 

77 

164 

161 

169 

157 

38 

105 

103 

101 

99 

1 78 

166 

163 

161 

169 

39 

107 

105 

103 

101 

I 79 

167 

164 

162 

160 

40 

108 

106 

104 

102 

80 

169 

166 

164 

162 

41 

110 

108. 

106 

104 

81 

170 

167 

165 

163 

42 

111 

109 

107 

105 

82 

172 

169 

167 

165 

43 

113 

111 

109 

107 

83 

173 

170 

168 

166 

44 

114 

112 

110 

108 

84 

175 

172 

170 

168 









INTELLIGENCE QUOTIENTS TABLES 


249 


INTELLIGENCE QUOTIENTS— AGES 1^-0 TO 12-11 


PEBFOBMAKCB SCALE 


WEIGHTED 

SCOBS 

12-0 

I.Q. 

12-0 

I.Q. 

12—6 

I.Q. 

12—9 

I.Q. 

WEIGHTED 

SCOBS 

12—0 

I.Q. 

12-3 

I.Q. 

12— « 
I.Q. 

12—9 

LQ. 

5 

48 

46 

44 

42 

45 

108 

106 

104 

102 

6 

50 

48 

46 

44 

46 


107 

105 

103 

7 

51 

49 

47 

45 

47 

111 

109 

107 

105 

8 

53 

51 

49 

47 

48 

112 

110 

108 

106 

9 

54 

52 

50 

48 

49 

114- 

112 

110 

108 

10 

56 

54 

52 

50 

50 

115 

113 

111 

109 

11 

57 

55 

53 

51 

51 

117 

115 

113 

111 

12 

58 

56 

55 

53 

52 

118 

116 

114 

112 

13 

60 

58 

56 

54 

53 

120 

118 

116 

114 

14 

62 

60 

58 

56 

54 

121 

119 

117 

115 

15 

63 

61 

59 

57 

55 

123 

121 

119 

117 

16 

65 

63 

61 

59 

56 

124 

122 

120 

118 

17 

66 

64 

62 

60 

57 

126 

124 

122 

120 

18 

68 

66 

64 

62 

58 

127 

125 

123 

121 

19 

69 

67 

65 

63 

59 

129 

127 

125 

123 

20 

70 

68 

67 

65 

60 

130 

128 

126 

124 

21 

72 

70 

68 

66 

61 

132 

130 

128 

126 

22 

73 

71 

70 

68 

62 

133 

131 

129 

127 

23 

75 

73 

71 

69 

63 

135 

133 

131 

129 

24 

77 

75 

73 

71 

64 

136 

134 

132 

130 

25 

78 

76 

74 

72 

65 

138 

136 

134 

132 

26 

79 

77 

76 

74 

66 

139 

137 

135 

133 

27 

81 

79 

77 

75 

67 

141 

139 

137 

135 

28 

82 

80 

79 

77 

68 

142 

140 

138 

136 

29 

84 

82 

80 

78 

69 

144 

142 

140 

138 

30 

85 

83 

82 

80 


145 

143 

141 

139 

31 

87 

85 

83 

81 

71 

146 

144 

143 

141 

32 

88 

86 

85 

83 

72 

148 

146 

1 144 

142 

33 

90 

88 

86 

84 

73 

150 

148 

146 

144 

34 

91 

89 

87 

85 

74 

151 

149 

147 

145 

35 

93 

91 

89 

87 

75 

153 

151 

i 149 

147 

36 

94 

92 

90 

88 

76 

154 

152 

150 

148 

37 

96 

94 

92 

90 

77 

155 

153 

151 

149 

38 

97 

95 

94 

92 

78 

157 

155 

153 

151 

39 

99 

97 

95 

93 

79 

159 

157 

155 

153 

40 

100 

98 

96 

94 

80 

160 

158 

156 

154 

41 

102 

! 100 

98 

96 

81 

161 

159 

158 

156 

42 

103 

1 101 

99 

1 97 

82 

163 

161 

159 

157 

43 

105 

1 103 

101 

99 

83 

164 

162 

; 160 

158 

44 

106 

! 104 

102 

100 

84 

166 

164 

162 

i 160 
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INTELLIGENCE QUOTIENTS— AGES 14—0 TO 14—11 


FERFOBMAMCE SCALE 


WSXCrStTBD 

SCOBB 

14-0 

I.Q. 

14—3 

I.Q. 

14-6 

I.Q. 

WEIQHTBD 

fiCORB 

14—0 

I.Q. 

14—3 

LQ. 

14—6 

LQ. 

5 

37 

36 

35 

45 

96 

95 

95 

6 

38 

37 

37 

46 

98 

97 

97 

7 

40 

39 

38 

47 

99 

98 

98 

8 

41 

40 

40 

48 

101 

100 

100 

9 

42 

41 

41 

49 

102 

101 

101 

10 

44 

43 

43 

50 

104 

103 

103 

11 

46 

44 

44 

61 

105 

104 

104 

12 

47 

46 

46 

62 

107 

106 

106 

13 

48 

47 

47 

63 

108 

107 

107 

14 

50 

49 

49 

54 

no 

109 

109 

15 

51 

60 

50 

65 

111 

no 

110 

16 

63 

62 

52 

66 

113 

112 

112 

17 

64 

53 

63 

67 

114 

113 

113 

18 

56 

55 

55 

58 

116 

115 

115 

19 

67 

66 

66 

59 

117 

116 

116 

20 

69 

68 

58 

60 

119 

118 

117 

21 

60 

69 

59 

61 

120 

119 

119 

22 

62 

61 

61 

62 

122 

121 

120 

23 

63 

62 

62 

63 

123 

122 

122 

24 

65 

64 

64 

64 

125 

124 

123 

25 

66 

65 

65 

65 

126 

125 

125 

26 

63 

67 

67 

66 

128 

127 

126 

27 

69 

68 

68 

67 

129 

128 

128 

23 

71 

70 

70 

68 

132 

131 

131 

29 

72 

71 

71 

69 

133 

132 

132 

30 

74 

73 

73 

70 

135 

134 

134 

31 

75 

74 

74 

71 

136 

135 

135 

32 

77 

76 

76 

72 

138 

137 

137 

33 

78 

77 

77 

73 

139 

138 

138 

34 

80 

79 

79 

74 

141 

140 

140 

35 

81 

80 

80 

75 

142 

141 

141 

86 

83 

82 

82 

76 

144 

143 

143 

37 

84 

83 

83 

77 

145 

144 

144 

38 

86 

85 

85 

78 

147 

146 I 

146 

39 

87 

86 

86 

79 

148 

147 

147 

40 

89 

88 

88 

80 

149 1 

149 

149 

41 

90 

89 

89 

81 

151 

150 

150 

42 

92 

91 

91 

82 

152 

152 

152 

43 

93 

92 

92 

83 ! 

154 

153 

153 

44 

95 

94 

94 

84 

155 

155 

155 
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INTELLIGENCE QUOTIENTS— AGES 16 TO 19 

FISBFORUAKCII SCALB 


WBIOHTBD 

SCOBB 

15 

I.Q. 

16 

I.Q. 

17-lfl 

I.Q. 

WSXQBTBD 

SCOBB 

15 

LQ. 

15 

LQ. 

17-19 

I.Q. 

5 

34 

*34 

34 

45 

93 

92 

91 

6 

36 

36 

36 

46 

95 

93 

92 

7 

37 

37 

37 

47 

96 

95 

94 

8 

39 

39 

39 

48 

98 

96 

95 

9 

40 

40 

40 

49 

99 

98 

96 

10 

42 

42 

41 

50 

101 

99 

98 

11 

43 

43 

43 

51 

102 

100 

99 

12 

45 

44 

44 

52 

104 

102 

101 

13 

46 

46 

46 

53 

105 

103 

102 

14 

48 

47 

47 

54 

107 

105 

104 

15 

49 

49 

48 

55 

108 

106 

105 

16 

50 

50 

50 

56 

110 

108 

106 

17 

62 

52 

51 

57 

111 

109 

108 

18 

53 

53 

53 

58 

113 

110 

109 

19 

55 

54 

54 

59 

114 

112 

111 

20 

56 

56 

56 

60 

116 


112 

21 

58 

57 

57 

61 

117 

Bfin 

113 

22 

59 

59 

58 

62 

119 


115 

23 

61 

60 

60 

63 

120 

118 

116 

24 

62 

62 

61 

64 

122 

119 

118 

25 

64 

63 

63 

65 

123 

120 

119 

26 

65 

65 

64 

66 

124 

122 

121 

27 

67 

66 

65 

67 

126 

123 

122 

28 

68 

67 

67 

68 

127 

125 

123 

29 

70 

69 

68 

69 

129 

126 

125 

30 

71 

70 

70 

70 

130 

128 

126 

31 

73 

72 

71 

71 

132 

129 

128 

32 

74 

73 

72 

72 

133 

131 

129 

33 

76 

75 

74 

73 

135 

132 

130 

34 

77 

76 

75 

74 

136 

133 

132 

35 

79 

77 

77 

75 

138 

135 

133 

36 

80 

79 

78 

76 

139 

136 

135 

37 

82 

80 

80 

77 

141 

138 

136 

38 

83 

82 

81 

78 

142 

139 

137 

39 

85 

83 

82 

79 

144 

141 

139 

40 

86 

85 

84 

80 

145 

142 

140 

41 

87 

86 

85 

81 

147 

143 

. 142 

42 

89 

87 

87 

82 

148 

145 

143 

43 

90 

89 

88 

83 

150 

146 

145 

44 

92 

90 

89 

84 

151 

148 

146 
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INTELLIGENCE QUOTIENTS— AGES 20 TO 34 


PERyORMANCB SCALE 


WSIGHTBD 

BCOBB 

20-24 

I.Q. 

25-29 

I.Q. 

30-34 

LQ. 

WSIGHTSD 

SCOBS 

20-24 

I.Q. 

25-29 

LQ. 

30-34 

LQ. 

5 

37 

42 

48 

45 

92 

96 

99, 

6 

39 

44 

49 

46 

93 

97 

101 

7 

40 

45 

51 

47 

95 

98 

102 

8 

41 

46 

52 

48 

96 

100 

103 

9 

43 

48 

53 

49 

97 

101 

104 

10 

44 

49 

65 

50 

99 

102 

106 

11 

46 

SO 

56 

51 

100 

104 

107 

12 

47 

52 

57 

52 

102 

105 

108 

13 

48 

53 

58 

53 

103 

106 

110 

14 

50 

54 

60 

54 

104 

108 

111 

15 

51 

56 

61 

55 

106 

^ 109 

112 

16 

52 

57 

62 

56 

107 

110 

114 

17 

54 

58 

64 

57 

109 

112 

115 

18 

55 

60 

65 

58 

110 

113 

116 

19 

56 

61 

66 

59 

111 

114 

117 

20 

58 

62 

67 

60 

113 

116 

119 

21 

59 

64 

69 

61 

114 

117 

120 

22 

61 

65 

70 

62 

115 

118 

121 

23 

62 

66 

71 

63 

117 

120 

122 

24 

63 • 

68 

73 

64 

118 

121 

124 

25 

65 

69 

74 

65 

119 

122 

125 

26 

66 

70 

75 

66 

121 

124 

126 

27 

67 

72 

76 

67 

122 

125 

127 

28 

69 

73 

78 

68 

124 

126 

129 

29 

70 

74 

79 

69 

125 

128 

130 

30 

72 

76 

80 

TO 

126 

129 

131 

31 

73 

77 

81 

71 

128 

130 

133 

32 

74 

78 

83 

72 

129 

132 

134 

33 

76 

80 

84 

73 

130 

133 

135 

34 

77 

81 

85 

74 

132 

134 

137 

35 

78 

82 

87 

75 

133 

136 

138 

36 

80 

84 

88 

76 

135 

137 

139 

37 

81 

85 

89 

77 

136 

138 

140 

38 

83 

86 

90 

78 

137 

140 

142 

39 

84 

88 

92 

79 

139 

141 

143 

40 

85 

89 

93 

80 

140 

142 

144 

41 

87 

90 

94 

81 

141 

144 

146 

42 

88 

92 

96 

82 

143 

145 

147 

43 

89 

93 

97 

83 

144 

146 

148 

44 

91 

94 

98 

84 

146 

148 

149 
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MEASUREMENT OF ADULT INTELLIGENCE 


INTELLIGENCE QUOTIENTS— AGES 35 TO 49 

PJBRFOBMANCB SCALE 
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Abilities, effect of mental disorders on, 
146 

Ability, different types of, 4, 5 
Adult I.Q/s, 28, 30, 189 
Adult mental age, 15, 22, 30 
Age, as factor in calculating adult I.Q.*s, 
12, 28, 30, 31; as factor in mental 
deterioration, 56, 60; as factor in 
standardization, 103; in relation to 
brain weight, 59-61; in relation to 
decline of various mental functions, 
56-61, 64; of maximum mental abil- 
ity, 25, 56, 200; regression equation 
for, on test scores, 143; tables of 
mean test scores, 118-119, 222 
Alcoholic psychosis, memory disturb- 
ances in, 67, 84 
Alexander, W. P., 9, 10 
Alpha Intelligence tests, 14, 16, 32, 33, 
38, 53, 76, 78, 80, 81, 90, 108, 117, 127, 
129, 140 

Altus, Lt. W. D., AGD, U.S.A. 172 
Aphasia, disturbance of Gestalt func- 
tion in, 69 

Arithmetic reasoning test, description 
of, 82; directions for, 175; scoring of, 
175 

^rmy Memoirs, 87 

Army Performance tests, 14, 16, 88, 95 
Arthur, G., 92, 126 

Average intelligence, statistical limits 
of, 39-41, 190 

Babcock, H., 63, 68 
Balinsky, B., 130, 157 
Bellevue Intelligence Scales, directions 
for administering, 171-213 
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mean scores and S.D.'s of Full Scale, 
122; of Performance Scale, 123; of 
Verbal Scale, 122; of subtests, 222; 
method of weighting, 117 


Bender, L., 68, 69 

Beta Intelligence tests, 14, 16, 33, 53, 76, 
90, 95 

Binet, A., contributions of, 6, 14, 19 
Block design test, description of, 91-94; 
directions for^ 183-185; scoring of, 
185 

Borderline intelligence, statistical limits 
of, 40, 190 
Bown, M.D., 149 

Brain weight, variations with age, 59, 61 
Bright Normal, definition of, 41; statis- 
tical limits of, 41, 190 
Burt, C., 25, 104 

C.A., psychometric definition of, 23 
Cattell. McK., 58 

Clinical features of Bellevue Scale, 146- 
148 
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each age, 124; of weighted score, 
121 

Common factor, 5, 7 
Comprehension test, description of, 80- 
81; directions for, 173-174; scoring 
of, 174 

Conrad, H. S-, 29, 59, 106 
Cornell-Coxe Performance tests, 87-95 
Correlations between Bellevue and other 
scales, 134; between Bellevue sub- 
tests and total scale, 224 
Court of General Sessions, use of Belle- 
vue Tests, 136 

Criteria for scoring, for general com- 
prehension test, 191-198; for simi- 
larities test, 198-202; for vocabu- 
lary test, 202-213 

Criticisms of I.Q. discrepancies, 136-140 

Curve of mental growth, 27 

Curve of mental growth and decline, 29 

DeCroly, 87 
Dementia praecox, 152 
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INDEX 


Deterioration ratio, 65-67 
Diagnostic features of Bellevue Scale 
146-167 

Differences between Bin^t and Bellevue 
I.Q.’s, 129, 143 

Differential-test-score method, 63 
Digit symbol test, description of, 94-96; 

directions for, 185; scoring of, 185 
Digit-span test, directions for, 176; scor- 
ing of, 176 

Discrepancies betwee'iT Verbal and Per- 
formance I.Q.’s, 138 
Doll, E. A., 43, 47 

Drive, as factor in general intelligence, 
3, 10, 47 

Dull-normal intelligence, statistical 
limits of, 40, 41, 190 

Education, as factor in intelligence, 103, 
104; of U. S. adult population, 105, 
106; of Bellevue samplings, 105, 114 
Efficiency quotients, 46, 47, 221 
Emotional defectives, 53 / 

Emotional deterioration, 55 

‘T' factor, 10 

Feature profile test, description of, 96- 
98; directions for, 181, 182; scoring 
of, 183 

Fishbein, S., 134 

Frequency curve, form of for Bellevue 
Scale, 126-127 

Full Scale, tests comprising, 77; I.Q. 

tables for, 231-238 
Functional equivalence of tests, 6 
Functional unities, 10 

“g^^ 7, 8,9,10, 11 
Galton, F., 43, 44 

General intelligence, definition of, 3; 
factors in, 3, 4, 7; non-intellective 
factors in, 10, 11 
General paresis, 67 
Gestalt, disturbances of, in old age, 69 
Gestalt tests, 69 
Gilbert, J. D., 63 
Goldfarb, W., 101 
Goldstein, K., 93 


Haggerty Delta test, 90 
Halpern, F., 156 

Hand test, description of, 96; directions 
for, 182; scoring of, 183 
*^Hard’^ signs, 160 • 

Healy Picture Completion test, 38, 89 

Heinis, H., 141 

Henmon-Nelson test, 135 

Hollingworth, L. S., 46 

Hull, C. L., 117 

Humphreys, E. J., 115 

Hunt, H. J., 63 

Illustrative cases, 51, 130-132, 160-166 
Information test, description of, 77-80; 

directions for, 172; scoring of, 173 
Intellectual disabilities and mental dis- 
order, 66-69; 148, 150-167 
Intelligence, classification of, 36, 47, 48; 
according to Terman, 37; statistical 
basis of classification, 44; zero point 
of, 32, 33 

Intelligence tests, need for adult types 
of, 12, 13-18 

Intelligence test scores, variation with 
age, 57-60 

Intertest correlations, 223, 224 
I;Q., arbitrary character of, 34; calcula- 
tion of, 24, 34, 188, 219; constancy 
of, 24r-26; 30, 31; definition of, 24, 30; 
limitations of, 25-27, 30, 31; value 
of, 24, 46; variability of, for Binet 
scales, 26 , 

I.Q/s, distribution of, 127; interpreta- 
tion of, 190-191 ; means and standard 
deviations of, 122-123; method of 
finding, 188, 189; percentile ranks of, 
42; tables of, 231-254; variability of, 
with age, ‘121 
Israel, H., 130, 157 

Jones, H. E., 29, 59, 106 

Kahn, E., 147 
Bander, E. F., 115 
Kohs, S. C., 91, 92, 94 
Kuhlmann, F., 16, 24, 141, 191 
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Levi, J., 214 
Lorge, I., 134 

M,A., advantages of, 43; assumed aver- 
age for adults, 14, 15; definition of, 
19, 20, 22; equivalents for Bellevue 
scales, 120; limitations of, 21, 22; 
linear relation of, to C.A., 28; upper 
limits of, 20 
Magaret, A., 157, 215 
Manikin test, Bellevue form of, 96, 97; 
directions for, 181-182; scoring of, 
182 

Memoirs of National Academy of Sci- 
ence, 109 

Memory span for digits, description of, 
83-85; directions for, 176-177; scor- 
ing of, 176 

Mental defectives, incidence of, in U. S. 
Army, 16; in U. S. population, 40, 
50; test patterns of, 151, 157 
Mental deficiency, concept of, 49; defini- 
tion of, 36, 49, 50, 51; diagnosis of, 
48, 53; limits of, 37, 39, 40, 190; not a 
unitary entity, 53 

Mental deterioration, definition of, 54; 
measurement of, 56, 58, 61; tests for, 
63-69 

Mental growth, curve of, 27 

Mental growth and decline, curve of, 29 

Mental products, 4 

Mental ratio, 25 

Miles, W. R., 29 

Morgan Mental Ability test, 135 

Nadel, A, B,, 92 
National Intelligence tests, 80 
Natural selection, influence of, on intel- 
ligence, 121 

Negroes, reason for omission from Belle- 
vue scale standardization, 107 
Neurotics, test patterns of, 151, 156, 165 
166 

New Yorker, 88 

New York Public Schools, superinten- 
dent’s report of, 113 
Non-measurable factors in intelligence, 
48 


Object assembly test, description of, 
96-98; directions for, 181-182; scor- 
ing of, 182-183 

Organic brain disease, intellectual dis- 
abilities in, 66, 67, 148; test pattern 
of, 148, 150, 153 
Otis tests of mental ability, 21 

P.E., as basis of defining I.Q.’s, 34, 35; 
as basis for intelligence classifica- 
tion, 39, 40, 81 
Pearl, R., 59, 61 
Per cent-of-average ratio, 141 
Performance scale, tests comprising, 77; 
description of, 87-98; discrepancies 
between it and Verbal scale, 136- 
140; I.Q. tables for, 247-254 
Performance tests, reason for including 
in general scale, 11; comparison with 
Verbal scales, 126; correlations with 
Full and Verbal scales, 124; decline 
with age, 64-67 ; in clinical diagnosis 
146, 147, 148, 153 

Picture arrangements test, description 
of, 87-89; directions for, 179-180; 
scoring of, 180-181 

Picture completion test, description of, 
89-91; directions for, 179; scoring 
of, 178 

Pintner-Paterson Performance scale, 16, 
38, 96, 97, 137 
Pintner, R,, 16, 112 
Practical intelligence, 5 
Psychopaths, 155, 164-165, test patterns 
of, 147, 151, 155-156 

Queens General Hospital, use of Bellevue 
scale, 136 

Rabin, I., 159 

Race, as a factor in intelligence, 107, 
108, 147 

Raw scores, definition of, 181 
Reliability of Bellevue scales, 133 
Rossle and Routlet, 60 
Ruger, H. A., 121 
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Segal, D., 105 
Senility, 55, 57, 59, 60, 67 
Sequin Form Board, 18 
Sex delinquents, 51, 52 
Sex differences, 106 
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Ship test, 20, 74 
Shipley, W. C., 63 

Similarities test, description of, 85-87; 

directions for, 177 ; scoring of, 178 
^^Signs”, psychometric definition of, 
152; method of ‘'counting'', 152, 159; 
list of, for various clinical groups, 
150-151 

Social defectives, 53 
Social intelligence, 5, 88, 89 
"Soft" signs, definition of, 160 
Soglow, 88 

Spearman, C., 5, 6, 7, 8, 9, 10, 83 
Specific factors, 5, 10 
Speed as factor in test scores, 18 
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122-123; of Bellevue subtests, 222; 
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Thurstone, L. L., 9, 32 
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Two-factor theory, 5, 6, 8, 9 
Trabue Sentence Completion test, 38 
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"v" factor, 10 
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Verbal Scale, tests comprising, 77; LQ. 
tables for, 239-246; means and S.D.'s 
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Vital capacity, 55; variation with age, 56 
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directions for, 180; in schizophrenia, 
100; scoring of, 186 
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Weighted scores, for Bellevue scales, 188; 
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White, W. A., 100 
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